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MODULATORS OF BODY WEIGHT, CORRESPONDING NUCLEIC 
ACIDS ANJ> PROTEINS, AND DIAGNOSTIC AND THERAPEUTIC USES 

THEREOF 



The research leading to the present inventions was funded in part by Grant No. 
5 DK 41096 from the National Institutes of Health. The government may have 
certain rights in the invention. 

RELATED APPLICATIONS 

The present application is a continuation-in-part of copending application Serial 
No. 08/292,345, filed August 17, 1994, of which the instant application claims the 
10 benefit of the filing date pursuant to 35 U.S. C. § 120, and which is incorporated 
herein by reference in its entirety. 

TECHNICAL FIELD OF THE INVENTION 

The present invention relates generally to the control of body weight of mammals 
including animals and humans, and more particularly to materials identified herein 
15 as modulators of weight, and to the diagnostic and therapeutic uses to which such 
modulators may be put. 

BACKGROUND OF THE INVENTION 



Obesity, defined as an excess of body fat relative to lean body mass, is associated 
with important psychological and medical morbidities, the latter including 

20 hypertension, elevated blood lipids, and Type II or non-insulin-dependent diabetes 
melitis (NIDDM). There are 6-10 million individuals with NIDDM in the U.S., 
including 18% of the population of 65 years of age (Harris et aL, 1987). 
Approximately 45% of males and 70% of females with NIDDM are obese, and 
their diabetes is substantially improved or eliminated by weight reduction (Harris, 

25 1991). As described below, both obesity and NIDDM are strongly heritable, 
though the predisposing genes have not been identified. The molecular genetic 



basis of these metabolically related disorders is an important, poorly understood 
problem. 

The assimilation, storage, and utilization of nutrient energy constitute a complex 
homeostatic system central to survival of metazoa. Among land-dwelling 
mammals, storage in adipose tissue of large quantities of metabolic fuel as 
triglycerides is crucial for surviving periods of food deprivation. The need to 
maintain a fixed level of energy stores without continual alterations in the size and 
shape of the organism requires the achievement of a balance between energy intake 
and expenditure. However, the molecular mechanisms that regulate energy 
balance remain to be elucidated. The isolation of molecules that transduce 
nutritional information and control energy balance will be critical to an 
understanding of the regulation of body weight in health and disease. 

An individual's level of adiposity is, to a large extent, genetically determined. 
Examination of the concordance rates of body weight and adiposity amongst mono- 
and dizygous twins or adoptees and their biological parents have suggested that the 
heritability of obesity (0.4-0.8) exceeds that of many other traits commonly 
thought to have a substantial genetic component, such as schizophrenia, 
alcoholism, and atherosclerosis (Stunkard et aL, 1990). Familial similarities in 
rates of energy expenditure have also been reported (Bogardus et aL 9 1986). 
Genetic analysis in geographically delimited populations has suggested that a 
relatively small number of genes may account for the 30% -50% of variance in 
body composition (Moll et al., 1991). However, none of the genes responsible for 
obesity in the general population have been genetically mapped to a definite 
chromosomal location. 

Rodent models of obesity include seven apparently single-gene mutations. The 
most intensively studied mouse obesity mutations are the ob (obese) and db 
(diabetes) genes. When present on the same genetic strain background, ob and db 
result in indistinguishable metabolic and behavioral phenotypes, suggesting that 
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these genes may function in the same physiologic pathway (Coleman, 1978). Mice 
homozygous for either mutation are hyperphagic and hypometabolic, leading to an 
obese phenotype that is notable at one month of age. The weight of these animals 
tends to stabilize at 60-70 g (compared with 30-35 g in control mice), ob and db 
5 animals manifest a myriad of other hormonal and metabolic changes that have 
made it difficult to identify the primary defect attributable to the mutation (Bray et 
al. 9 1989). 

Each of the rodent obesity models is accompanied by alterations in carbohydrate 
metabolism resembling those in Type II diabetes in man. In some cases, the 

10 severity of the diabetes depends in part on the background mouse strain (Leiter, 
1989). For both ob and db, congenic C57BL/Ks mice develop a severe diabetes 
with ultimate )8 cell necrosis and islet atrophy, resulting in a relative insulinopenia. 
Conversely, congenic C57BL/6J ob and db mice develop a transient insulin- 
resistant diabetes that is eventually compensated by 0 cell hypertrophy resembling 

15 human Type II diabetes. 

The phenotype of ob and db mice resembles human obesity in ways other than the 
development of diabetes - the mutant mice eat more and expend less energy than 
do lean controls (as do obese humans). This phenotype is also quite similar to that 
seen in animals with lesions of the ventromedial hypothalamus, which suggests 

20 that both mutations may interfere with the ability tcrproperly integrate or respond 
to nutritional information within the central nervous system. Support for this 
hypothesis comes from the results of parabiosis experiments (Coleman, 1973) that 
suggest ob mice are deficient in a circulating satiety factor and that db mice are 
resistant to the effects of the ob factor (possibly due to an ob receptor defect). 

25 These experiments have led to the conclusion that obesity in these mutant mice 
may result from different defects in an afferent loop and/or integrative center of 
the postulated feedback mechanism that controls body composition. 



Using molecular and classical genetic markers, the ob and db genes have been 
mapped to proximal chromosome 6 and midchromosome 4, respectively (Bahary et 
aL, 1990; Friedman et al., 1991b). In both cases, the mutations map to regions 
of the mouse genome that are syntonic with human, suggesting that, if there are 
human homologs of ob and db, they are likely to map, respectively, to human 
chromosomes 7q and lp. Defects in the db gene may result in obesity in other 
mammalian species: in genetic crosses between Zucker falfa rats and Brown 
Norway +/+ rats, the fa mutation (rat chromosome 5) is flanked by the same loci 
that flank db in mouse (Truett et aL, 1991). 

Because of the myriad factors that seem to impact body weight, it is difficult to 
speculate as to which of these factors, and more particularly, which homeostatic 
mechanism is actually primarily determinative. Nonetheless, the apparent 
connection between the ob gene and the extent and characteristics of obesity have 
prompted the further investigation and elucidation that is reflected by the present 
application. It is the identification of the sequence of the gene and corresponding 
peptide materials, to which the present invention following below directs itself. 

The citation of any reference herein should not be construed as an admission that 
such reference is prior art to the instant invention. Full citations of references 
cited by author and year are found at the end of the specification. 

SUMMARY OF THE INVENTION 

In its broadest aspect, the present invention relates to the elucidation and discovery 
of nucleotide sequences, and proteins putatively expressed by such nucleic acids or 
degenerate variations thereof, that demonstrate the ability to participate in the 
control of mammalian body weight. The nucleotide sequences in object are 
believed to represent the genes corresponding to the murine and human ob gene, 
that is postulated to play a critical role in the regulation of body weight and 
adiposity. Data presented herein indicates that the polypeptide product of the gene 



in question is secreted by the cells that express it and that the polypeptide 
functions as a hormone. 

In a first instance, the modulators of the present "invention comprise nucleic acid 
molecules, including recombinant DNA molecules (e.g., cDNA or a vector 
containing the cDNA or isolated genomic DNA) or cloned genes (i.e. isolated 
genomic DNA), or degenerate variants thereof, which encode polypeptides 
themselves serving as modulators of weight control as hereinafter defined, or 
conserved variants or fragments thereof, particularly such fragments lacking the 
signal peptide (alternatively referred to herein as mature ob polypeptide), which 
polypeptides possess amino acid sequences such as set forth in FIGURE 1 (SEQ 
ID NO:2), FIGURE 3 (SEQ ID NO:4), FIGURE 5 (SEQ ID NO:5) and FIGURE 
6 (SEQ ID NO: 6). In specific embodiments, amino acid sequences for two 
variants of murine and human ob polypeptides are provided. Both polypeptides 
are found in a form with glutamine 49 deleted, which may result from an mRNA 
splicing anomaly. The ob polypeptides from various species may be highly 
homologous; as shown in Figure 4, murine and human ob polypeptides are greater 
than 80% homologous. 

The nucleic acid molecules, recombinant DNA molecules, or cloned genes, may 
have the nucleotide sequences or may be complementary to DNA coding sequences 
shown in FIGURE 1 (SEQ ID NO:l) and FIGURE 2 (SEQ ID NO:3). In 
particular, such DNA molecules can be cDNA or genomic DNA isolated from the 
chromosome. Nucleic acid molecules of the invention may also correspond to 5' 
and 3' flanking sequences of the DNA. Accordingly, the present invention also 
relates to the identification of a gene having a nucleotide sequence selected from 
the sequences of Figure 1 (SEQ ID NO:l) and Figure 2 (SEQ ID NO:3) herein, 
and degenerate variants, allelic variations, and like cognate molecules. 
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A nucleic acid molecule of the invention can be DNA or RNA, including synthetic 
variants thereof having phosphate or phosphate analog, e.g. , thiophosphate, bonds. 
Both single stranded and double stranded sequences are contemplated herein. 

The present invention further provides nucleic acid molecules for use as molecular 
5 probes, or as primers for polymerase chain reaction (PCR) amplification, i.e., 
synthetic or natural oligonucleotides having a sequence corresponding to a portion 
of the sequences shown in Figure 1 (SEQ ID NO:l), Figure "2 (SEQ ID NO:3) and 
Figure 20A (SEQ ID NO:22), or the 5' and 3' flanking sequences of the coding 
sequences. In particular, the invention contemplates a nucleic acid molecule 

10 having at least about 10 nucleotides, wherein a sequence of the nucleic acid 

molecule corresponds to a nucleotide sequence of the same number of nucleotides 
in the nucleotide sequences of Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:3) 
and Figure 20A (SEQ ID NO: 22), or a sequence complementary thereto. More 
preferably, the nucleic acid sequence of the molecule has at least 15 nucleotides. 

15 Most preferably, the nucleic acid sequence has at least 20 nucleotides. In an 
embodiment of the invention in which the oligonucleotide is a probe, the 
oligonucleotide is detectably labeled, e.g., with a radionuclide (such as 32 P), or an 
enzyme. 

In further aspects, the present invention provides a cloning vector, which 
20 comprises the nucleic acids of the invention that encode the ob polypeptide; and a 
bacterial, insect, or a mammalian expression vector, which comprises the nucleic 
acid molecules of the invention encoding the ob polypeptide, operatively associated 
with an expression control sequence. Accordingly, the invention further relates to 
a host cell, such as a bacterial cell, yeast cell, insect cell, or a mammalian cell, 
25 transfected or transformed with an appropriate expression vector, and 

correspondingly, to the use of the above mentioned constructs in the preparation of 
the modulators of the invention. 



In yet a further aspect, the present invention relates to antibodies that bind to the 
ob polypeptide. Such antibodies may be generated against the full length 
polypeptide, or antigenic fragments thereof. In one aspect, such antibodies inhibit 
the functional (i.e., body weight and fat composition modulating) activity of the ob 
polypeptide. 

All of the foregoing materials are to be considered herein as modulators of body 
weight and fat composition, and as such, may be used in a Variety of contexts. 
Specifically, the invention contemplates both diagnostic and therapeutic 
applications, as well as certain agricultural applications, all contingent upon the 
use of the modulators defined herein, including both nucleic acid molecules and 
peptides. Moreover, the modulation of body weight carries specific therapeutic 
implications and benefits, in that conditions where either obesity or, conversely, 
cachexia represent undesired bodily conditions, can be remedied by the 
administration of one or more of the modulators of the present invention. 

Thus, a method for modulating body weight of a mammal is proposed that 
comprises controlling the expression of the protein encoded by a nucleic acid 
having nucleotide sequence selected from the sequence of Figure 1 (SEQ ID 
NO:l), the sequence of Figure 2 (SEQ ID NO: 3) and degenerate and allelic 
variants thereof. Such control may be effected by the introduction of the 
nucleotides in question by gene therapy into fat cells of the patient or host to 
control or reduce obesity. Conversely, the preparation and administration of 
antagonists to the nucleotides, such as anti-sense molecules, would be indicated 
and pursued in the instance where conditions involving excessive weight loss, such 
as anorexia nervosa, cancer, or AIDS are present and under treatment. Such 
constructs would be introduced in similar fashion to the nucleotides, directly into 
fat cells to effect such changes. 

Correspondingly, the proteins defined by Figures 1, 3, 5, and 6 (SEQ ID NO:l, 
SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6), conserved variants, active 



) ) 
8 

fragments thereof, and cognate small molecules could be formulated for direct 
administration for therapeutic purposes, to effect reduction or control of excessive 
body fat or weight gain. Correspondingly, antibodies and other antagonists to the 
stated protein materials, such as fragments thereof, could be prepared and 
5 similarly administered to achieve the converse effect. Accordingly, the invention 
is advantageously directed to a pharmaceutical composition comprising an ob 
polypeptide of the invention, or alternatively an antagonist thereof, in an 
admixture with a pharmaceutical^ acceptable carrier or excipient. 

The diagnostic uses of the present nucleotides and corresponding peptides extend 
to the use of the nucleotides to identify further mutations of allelic variations 
thereof, so as to develop a repertoire of active nucleotide materials useful in both 
diagnostic and therapeutic applications. In particular, both homozygous and 
heterozygous mutations of the nucleotides in question could be prepared that would 
be postulated to more precisely quantitate the condition of patients, to determine 
the at-risk potential of individuals with regard to obesity. Specifically, 
heterozygous mutations are presently viewed as associated with mild to moderate 
obesity, while homozygous mutations would be associated with a more pronounced 
and severe obese condition. Corresponding DNA testing could then be conducted 
utilizing the aforementioned ascertained materials as benchmarks, to facilitate an 
accurate long term prognosis for particular tendencies, so as to be able to 
prescribe changes in either dietary or other personal habits, or direct therapeutic 
intervention, to avert such conditions. 

The diagnostic utility of the present invention extends to methods for measuring 
the presence and extent of the modulators of the invention in cellular samples or 
25 biological extracts (or samples) taken from test subjects, so that both the encoded 
nucleotide (genomic DNA or RNA) and or the levels of protein in such test 
samples could be ascertained. Given that the increased activity of the nucleotide 
and presence of the resulting protein reflect the capability of the subject to inhibit 
obesity, the physician reviewing such results in an obese subject would determine 
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that a factor other than dysfunction with respect to the presence and activity of the 
nucleotides of the present invention is a cause of the obese condition. Conversely, 
depressed levels of the nucleotide and or the expressed protein would suggest that 
such levels must be increased to treat such obese condition, and an appropriate 
5 therapeutic regimen could then be implemented. 

Further, the nucleotides discovered and presented in Figures 1 and 2 represent 
cDNA in which, as stated briefly above, is useful in the measurement of 
corresponding RNA. Likewise, recombinant protein material corresponding to the 
polypeptides of Figures 1 and 3 may be prepared and appropriately labeled, for 
use, for example, in radioimmunoassays, for example, for the purpose of 
measuring fat and/or plasma levels of the ob protein, or for detecting the presence 
and level of a receptor for ob on tissues, such as the hypothalamus. 

Yet further, the present invention contemplates not only the identification of the 
nucleotides and corresponding proteins presented herein, but the elucidation of the 
receptor to such materials. In such context, the polypeptides of Figures 1, 3, 5, 
and/or 6 could be prepared and utilized to screen an appropriate expression library 
to isolate active receptors. The receptor could thereafter be cloned, and the 
receptor alone or in conjunction with the ligand could thereafter be utilized to 
screen for small molecules that may possess like activity to the modulators herein. 

20 Yet further, the present invention relates to pharmaceutical compositions that 
include certain of the modulators hereof, preferably the polypeptides whose 
sequences are presented in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5 and SEQ 
ID NO: 6, their antibodies, corresponding small molecule agonists or antagonists 
thereof, or active fragments prepared in formulations for a variety of modes of 

25 administration, where such therapy is appropriate. Such formulations would 
include pharmaceutical^ acceptable carriers, or other adjuvants as needed, and 
would be prepared in effective dosage ranges to be determined by the clinician or 
the physician in each instance. 
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Accordingly, it is a principal object of the present invention to provide modulators 
of body weight as defined herein in purified form, that exhibit certain 
characteristics and activities associated with control and variation of adiposity and 
fat content of mammals. 

It is a further object of the present invention to provide methods for the detection 
and measurement of the modulators of weight control as set forth herein, as a 
means of the effective diagnosis and monitoring of pathological conditions wherein 
the variation in level of such modulators is or may be a characterizing feature. 

It is a still further object of the present invention to provide a method and 
associated assay system for the screening of substances, such as drugs, agents and 
the like, that are potentially effective to either mimic or inhibit the activity of the 
modulators of the invention in mammals. 

It is a still further object of the present invention to provide a method for the 
treatment of mammals to control body weight and fat content in mammals, and or 
to treat certain of the pathological conditions of which abnormal depression or 
elevation of body weight is a characterizing feature. 

It is a still further object of the present invention to prepare genetic constructs for 
use in genetic therapeutic protocols and or pharmaceutical compositions for 
comparable therapeutic methods, which comprise or are based upon one or more 
of the modulators, binding partners, or agents that may control their production, 
or that may mimic or antagonize their activities. 

Other objects and advantages will become apparent to those skilled in the art from 
a review of the ensuing description which proceeds with reference to the following 
illustrative drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



FIGURE 1 depicts the nucleic acid sequence (SEQ ID NO: 1) and deduced amino 
acid sequence (SEQ ID NO:2) derived for the murine ob cDNA. A 97 base pair 
5' leader was followed by a predicted 167 amino acid open reading frame and an 
approximately 3700 kB 3' untranslated sequence. A total of about 2500 base pairs 
of the 3' unstranslated sequence is shown. Analysis of the predicted protein 
sequence by observation and using the SigSeq computer program indicates the 
presence of a signal sequence (underlined). Microheterogeneity of the cDNA was 
noted in that approximately 70% of the cDNAs had a glutamine codon at codon 49 
and 30% did not (see FIGURES 5 and 6, infra). This amino acid is Underlined, 
as is the arginine codon that is mutated in C57BL/6J ob/ob mice (1J mice). 

FIGURE 2 depicts the nucleic acid sequence (SEQ ID NO: 3) derived for the 
human ob cDNA. The nucleotides are numbered from 1 to 701 with a start site at 
nucleotide 46 and a termination at nucleotide 550. 

FIGURE 3 depicts the full deduced amino acid sequence (SEQ ID NO:4) derived 
for the human ob gene corresponding to the nucleic acid sequence of FIGURE 2. 
The amino acids are numbered from 1 to 167. A signal sequence cleavage site is 
located after amino acid 21 (Ala) so that the mature protein extends from amino 
acid 22 (Val) to amino acid 167 (Cys). 

FIGURE 4 depicts the comparison between the murine (SEQ ID NO:2) and 
human (SEQ ID NO:4) deduced amino acid sequences. The sequence of the human 
ob deduced amino acid sequence was highly homologous to that of mouse. 
Conservative changes are noted by a dash, and non-conservative changes by an 
asterisk. The variable glutamine codon is underlined, as is the position of the 
nonsense mutation in C57BL/6J ob/ob (1J) mice. Overall, there is 84% identity at 
the amino acid level, although only six substitutions were found between the valine 
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at codon 22 (immediately downstream of the signal sequence overage) and the 
cysteine at position 117. 

FIGURE 5 depicts the full length amino acid sequence (SEQ ID NO:5) derived 
for the murine ob gene as shown in FIGURE 3, but lacking glutamine at position 
49. The nucleotides are numbered from 1 to 166. A signal sequence cleavage site 
is located after amino acid 21 (Ala) (and thus, before the glutamine 49 deletion) so 
that the mature protein extends from amino acid 22 (Val) to amino acid 166 (Cys). 

FIGURE 6 depicts the full deduced amino acid sequence (SEQ ID NO: 6) derived 
for the human ob gene as shown in FIGURE 4, but lacking glutamine at position 
49. The nucleotides are numbered from 1 to 166. A signal sequence cleavage site 
is located after amino acid 21 (Ala) (and thus, before the glutamine 49 deletion) so 
that the mature protein extends from amino acid 22 (Val) to amino acid 166 (Cys). 

FIGURE 7. (A) Physical map of the location of ob in the murine chromosome, 
and the YAC and PI cloning maps. "M and N" corresponds to MuR and Noil 
restriction sites. The numbers correspond to individual animals that were 
recombinant in the region of ob of the 1606 meioses that were scored. Met, Pax 
4, D6Rck39, D6Rckl3, and Cpa refer to locations in the region of ob that bind to 
the DNA probes. YACs were isolated using D6Rckl3 and Pax-4 as probes, and 
the ends were recovered using vectorette PCR and/or plasmid end rescue and used 
in turn to isolate new YACs. (B) The resulting YAC contig. One of the YACs in 
this contig, Y902A0925, was chimeric. Each of the probes used to genotype the 
recombinant animals is indicated in parentheses. (6) Corresponds to YAC 107; (5) 
corresponds to M16(+) (or M16(pLUS)); (4) corresponds to adu(+)\ (3) 
corresponds to aad($\CL)\ (2) corresponds to 53(pICL); and (1) corresponds to 
53(+). (C) The PI contig of bacteriophage PI clones isolated with selected YAC 
end probes. The ob gene was isolated in a PI clone isolated using the distal end 
of YAC YB6S2F12 (end (4)) (alternatively termed herein adu(+)). 
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FIGURE 8 presents a photograph of an ethidium bromide stain of 192 independent 
isolates of the fourth exon trapping experiment that were PCR amplified and 
characterized. 

FIGURE 9 is a photograph of an ethidium bromide stain of PCR-amplified clones 
5 suspected of carrying ob. Each of the 7 clones that did not carry the artifact was 
reamplified using PCR and electrophoresed on a 1 % agarose gel in TBE and 
stained with ethidium bromide. The size markers (far left unnumbered lane) are 
the commercially available "1 kB ladder". Lane 1 — clone 1D12, containing an 
"HIV sequence." Lane 2 -- clone 1F1, a novel clone outside of the ob region. 
10 Lane 3 — clone 1H3. Lane 4 — clone 2B2, which is the identical to 1F1. Lane 
5 — clone 2G7, which contains an ob exon. Lane 6 — clone 2G11, which is 
identical to 1F1. Lane 7 — clone 2H1, which does not contain an insert. 

FIGURE 10 presents the sequence of the 2G7 clone (SEQ ID NO:7), which 
includes an exon coding for a part of the ob gene. The primer sequences used to 
15 amplify this exon are boxed in the figure (SEQ ID NOS:8 and 9). 



M= FIGURE 11. (A) Reverse transcription-PCR analysis of mRNA from different 

K tissues of the same mouse with the 2G7 primers and actin primers. The RT-PCR 

j=t:=. 

reactions were performed using 100 ng of total RNA reverse transcribed with 
oligo dT as a primer for first strand cDNA. PCR amplification was performed for 
20 35 cycles with 94° denaturation x 1'; 55° hybridization x 1'; and 72 extensions 
for 2' with a V second autoextension per cycle. RT-PCR products were resolved 
in a 2% low melting point agarose gel run in lx TBE buffer. (B) Northern blot of 
mRNA from different organs of the mouse using PCR labeled 2G7 as a probe. 
Ten fig of total RNA from each of the tissues was electrophoresed on an agarose 
25 gel with formaldehyde. The probe was hybridized at 65 °C in Rapid Hybe 

(Amersham). Autoradiographic signals were apparent after 1 hour of exposure; 
the experiment shown was the result of a 24 hour exposure. 
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FIGURE 12. (A) An ethidium bromide stain from an RT PCR reaction on fat 
cell (white adipose tissue) RNA from each of the mouse strains listed. Total RNA 
(100 ng) for each sample was reverse transcribed using oligo dT and reverse 
transcriptase, and the resulting single stranded cDNA was PCR amplified with the 
5 2G7 primers (lower bands) or actin primers (upper bands). Both the 2G7 and 
actin primers were included in the same PCR reaction. The products were run on 
a 1% agarose TBE gel. (B) Northern analysis corresponding to (A). Ten /xg of 
fat cell (white adipose tissue) RNA from each of the strains indicated were run out 
and probed with the PCR labeled 2G7 probe as in Figure 11B, above. An 

10 approximately 20- fold increase in the level of 2G7 mRNA was apparent in white 
fate RNA from the C57BL/6J ob/ob (1J) strain relative to lean litterm^tes. In both 
the RT-PCR and Northern experiments there was no detectable signal in 2G7 RNA 
from the SM/Ckc-+ Dac ob 2J /ob 2J (2J) mice even after a 2 week exposure. A 24 
hour autoradiographic exposure is shown. The same filter was hybridized to an 

15 actin probe (bottom portion of the panel). 

FIGURE 13 is a Northern analysis of additional 2J animals and control animals 
that confirms the absence of the ob mRNA from 2 J animals. The Northern 
analysis was performed as in Figures 11 and 12. In this case, the control RNA 
was ap2, a fat specific transcript. There is no significance to the varying density 
20 of the ap2 bands. 



FIGURE 14 compares the DNA sequence of the C57BL/6J (normal) and the 
C57BL/6J ob/ob (1J) mice in the region of the point mutation that leads to 
introduction of a premature stop codon (nonsense mutation) in the mutant strain 
cDNA. The ob/ob mice had a C-»T mutation that changed an arginine residue at 
25 position 105. This base change is shown as the output from the automated DNA 
sequencer. RT-PCR was performed using white fat RNA from both strains (+/+ 
and ob/ob) using primers from the 5' and 3' unstranslated regions. The PCR 
reaction products were gel purified and directly sequenced manually and using an 
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ABI 373A automated sequencer with primers along both strands of the coding 
sequence. 
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FIGURE 15. (A) Genomic southern blot of genomic DNA from each of the 
mouse strains listed. Approximately 5 \ig of DNA (derived from genomic DNA 
5 prepared from liver, kidney or spleen) was restriction digested with the restriction 
enzyme indicated. The DNA was then electrophoresed in a 1% agarose TBE gel 
and probed with PCR labeled 2G7. Restriction digestion with BgRl revealed an 
increase in the size of an approximately 9 kB (the largest) BgRl fragment in 
SM/Ckc-H-Dacob^/ob 21 (2 J) DNA. RFLPs were not detectable with any other 
10 restriction enzymes. Preliminary restriction mapping of genomic DNA indicated 
^ that the polymorphic BgRl site is about 7 kB upstream of the transcription start 

Jj site. None of the other enzymes tested extend past the mRNA start site. (B) 

Cm 

p Segregation of a BgRl polymorphism in the SM/CkC'+Dacob^/ob 21 strain. Six 

obese and five lean progeny from the same generation of the coisogenic 
15 SM/Ckc-H-^ob^/ob 2 ' (IS) colony were genotyped by scoring the BgRl 

polymorphism as shown in (A). All of the phenotypically obese animals were 
homozygous for the larger allele of the polymorphic Bgl fragment. The DNA in 
the "control" lane was prepared from an unrelated SM/Ckc-+ Dac +/+ mouse, bred 
separately from the SM/Ckc-H-^ob^/ob 21 colony. 

20 FIGURE 16 is a Southern blot of EcoRI digested genomic DNA from the species 
listed, using an ob cDNA as a probe (i.e., a zoo blot). Hybridization signals were 
detectable in every vertebrate sample, even after a moderate stringency 
hybridization. The cat DNA in this experiment was slightly degraded. The 
restricted DNA was run on a 1% agarose TBE gel, and transferred to an imobilon 
25 membrane for probing. The filter was hybridized at 65 °C and washed in 2X 
SSC/0.2% SDS at 65 °C twice for twenty minutes and exposed for 3 days using 
Kodak X-OMAT film. 



FIGURE 17 presents the expression cloning region of vector pET-15b (Novagen). 
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FIGURE 18 presents analysis of the eluate from a His-binding resin (Ni) column 
for a recombinant mature murine ob fusion to a His-tag (A) and mature human ob 
fusion to a His-tag (B). Bacteria transformed with vectors pETM9 and pETH14, 
respectively. Upon induction with 1 mM IPTG at optimal conditions, the 
5 transformed bacteria were able to produce 100-300 /xg/ml of ob fusion protein, 
primarily in the inclusion body. The inclusion body was solubilized with 6M 
guanidine-HCl or urea, and fusion protein (present in the lysis supernatant) was 
loaded on the His-binding resin (Ni) column in 10 ml of lx binding buffer with 
urea. The column was eluted stepwise with 5 ml aliquots of 20 fiM, 60 /xM, and 
10 300 /xM imidazole, and finally with strip buffer. The aliquots were analyzed for 
the presence of ob polypeptide fusion on a 15% acrylamide gel. Each lane 
contains the equivalent of 100 fil of bacterial extract. 

FIGURE 19. (A) In vitro translation of ob RNA. A human ob cDNA was 
subcloned into the pGEM vector. The plasmid was linearized and plus strand 
15 RNA was synthesized using sp6 polymerase. The in vitro synthesized RNA was 
translated in the presence or absence of canine pancreatic microsomal membranes. 
An approximately 18 kD primary translation product was seen after in vitro 
translation. The addition of microsomal membranes to the reaction led to the 
appearance of a second translation product about 2 kD smaller than the primary 
20 translation product. The size of the translation product of interleukin-la RNA, 
which lacks an encoded signal sequence, was unchanged by the addition of 
microsomal membranes. These data indicated the presence of a functional signal 
sequence. (B) In vitro translation in the presence or absence of proteinase K. 
Protease treatment resulted in complete proteolysis of the 18 kD primary 
25 translation product, while the 16 kD processed form was unaffected. 

Permeabilization of the microsome with 0.1% TRITON-X100 rendered the 
<A«ifo* processed Stem protease sensitive. These results indicate that the product had 
translated into the lumen of the microsome. 
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FIGURE 20. (A) The sequence of the human ob gene (SEQ ID NO:22). (B) A 
schematic diagram of the murine ob gene. (C) A schematic diagram of the human 
ob gene. In both (B) and (C), the start and stop codons are underlined. There is 
no evidence of a first intron homologous to the mouse first intron in the human 
gene, but its existence cannot be excluded. 

FIGURE 21 presents a schematic drawing of one of the cloning strategies 
employed to achieve recombinant expression of ob in pichia yeast. (A) Expression 
vector of ob with an or-mating factor signal sequence. (B) Schematic drawing of 
the structure of the recombinant fusion protein, including the amino acid sequence 
(SEQ ID NO:23) showing the Xhol site and putative KEX-2 and STE-13 cleavage 
sites, and the N-terminal surplus amino acids present after KEX-2 cleavage (SEQ 
ID NO: 24). (C) An alternative strategy for producing mature ob under involves 
preparing a construct with an amino acid sequence corresponding to a Xhol 
cleavage site and a KEX-2 cleavage site immediately upstream of the mature ob 
polypeptide sequence (SEQ ID NO:25). 

FIGURE 22 Alternative expression strategy in pichia. (A) Expression vector of 
an ob fusion with a His tag adopted from the pET expression system under control 
of the a-mating factor signal sequence. (B) Schematic drawing of the structure of 
the recombinant ob fusion protein containing a His tag, which includes the a- 
mating factor signal sequence, putative KEX-2 and STE-13 cleavage sites, the His- 
tag, and a thrombin cleavage site, and which would yield ob with three surplus N- 
terminal amino acid residues. 

FIGURE 23. (A) PAGE analysis of expression of murine ob (both the 
microheterogenous forms, i.e., containing and missing Gin 49) in transformed 
pichia yeast. The expected band of approximately 16 kD is visible in the 
transformed yeast culture fluid (second and third lanes), but not in culture fluid 
from non- transformed yeast (first lane). (B) PAGE analysis of partially purified 
recombinant ob polypeptide on carboxymethyl cellulose, a weak cation exchanger. 
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A band of about 16 kD is very visible in fractions 3 and 4 from the column, which 
was eluted with 250 mM NaCl. Lane 1 — loaded sample; lane 2 — flow through; 
lanes 3-5 - fractions eluted with 250 mM NaCl. 

DETAILED DESCRIPTION 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the 
skill of the art. Such techniques are explained fully in the literature. See, e.g., 
Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second 
Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (herein "Sambrook et al., 1989"); DNA Cloning: A Practical Approach, 
Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait 
ed. 1984); Nucleic Acid Hybridization [B.D. Hames & S.J. Higgins eds. (1985)]; 
Transcription And Translation [B.D. Hames & S.J. Higgins, eds. (1984)]; Animal 
Cell Culture [R.I. Freshney, ed. (1986)]; Immobilized Cells And Enzymes [IRL 
Press, (1986)]; B. Perbal, A Practical Guide To Molecular Cloning (1984). Of 
particular relevance to the present invention are strategies for isolating, cloning, 
sequencing, analyzing, and characterizing a gene or nucleic acid based on the well 
known polymerase chain reaction (PCR) techniques. 

Therefore, if appearing herein, the following terms shall have the definitions set 
20 out below. 

The term "body weight modulator", "modulator", "modulators", and any variants 
not specifically listed, may be used herein interchangeably, and as used throughout 
the present application and claims refers in one instance to both nucleotides and to 
proteinaceous material, the latter including both single or multiple proteins. More 
25 specifically, the aforementioned terms extend to the nucleotides and to the DNA 
having the sequences described herein and presented in Figure 1 (SEQ ID NO:l), 
and Figure 2 (SEQ ID NO:3). Likewise, the proteins having the amino acid 
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sequence data described herein and presented in Figure 1 (SEQ ID NO:2), and 
Figure 3 (SEQ ID NO:4) are likewise contemplated, as are the profile of activities 
set forth with respect to all materials both herein and in the claims. Accordingly, 
nucleotides displaying substantially equivalent or altered activity are likewise 
contemplated, including substantially homologous analogs and allelic variations. 
Likewise, proteins displaying substantially equivalent or altered activity, including 
proteins modified deliberately, as for example, by site-directed mutagenesis, or 
accidentally through mutations in hosts that produce the modulators are likewise 
contemplated. 

The terms "protein, " which refers to the naturally occurring polypeptide, and 
"polypeptide" are used herein interchangeably with respect to the ob gene product 
and variants thereof. The term "mature protein" or "mature polypeptide" refers to 
the ob gene product with the signal sequence (or a fusion protein partner) 
removed. 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that 
functions as an autonomous unit of DNA replication in vivo, i.e., capable of 
replication under its own control. 

A "vector" is a replicon, such as a plasmid, phage or cosmid, to which another 
DNA segment may be attached so as to bring about the replication of the attached 
segment. 

A "cassette" refers to a segment of DNA that can be inserted into a vector at 
specific restriction sites. The segment of DNA encodes a polypeptide of interest, 
and the cassette and restriction sites are designed to ensure insertion of the cassette 
in the proper reading frame for transcription and translation. 
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"Heterologous" DNA refers to DNA not naturally located in the cell, or in a 
chromosomal site of the cell. Preferably, the heterologous DNA includes a gene 
foreign to the cell. 

A cell has been " trans fected" by exogenous or heterologous DNA when such DNA 
5 has been introduced inside the cell. A cell has been "transformed" by exogenous 
or heterologous DNA when the transfected DNA effects a phenotypic change. 
Preferably, the transforming DNA should be integrated (covalently linked) into 
chromosomal DNA making up the genome of the cell. 

A "clone" is a population of cells derived from a single cell or common ancestor 
10 by mitosis. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of 
ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or 
deoxy ribonucleos ides (deoxyadenosine, deoxyguanosine, deoxy thymidine, or 
deoxy cytidine; "DNA molecules") in either single stranded form, or a double- 

15 stranded helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices 
are possible. The term nucleic acid molecule, and in particular DNA or RNA 
molecule, refers only to the primary and secondary structure of the molecule, and 
does not limit it to any particular tertiary or quaternary forms. Thus, this term 
includes double-stranded DNA found, inter alia, in linear or circular DNA 

20 molecules {e.g., restriction fragments), plasmids, and chromosomes. In discussing 
the structure of particular double-stranded DNA molecules, sequences may be 
described herein according to the normal convention of giving only the sequence in 
the 5' to 3' direction along the nontranscribed strand of DNA (i.e., the strand 
having a sequence homologous to the mRNA). A "recombinant DNA molecule" 

25 is a DNA molecule that has undergone a molecular biological manipulation. 



A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such 
as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic 
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acid molecule can anneal to the other nucleic acid molecule under the appropriate 
conditions of temperature and solution ionic strength (see Sambrook et al., supra). 
The conditions of temperature and ionic strength determine the "stringency" of the 
hybridization. For preliminary screening for homologous nucleic acids, low 
5 stringency hybridization conditions, corresponding to a T m of 55°, can be used, 
e.g., 5x SSC, 0.1% SDS, 0.25% milk, and no formamide; or 30% formamide, 5x 
SSC, 0.5% SDS). Moderate stringency hybridization conditions correspond to a 
higher T m , e.g. , 40% formamide, with 5x or 6x SCC. High stringency 
hybridization conditions correspond to the highest T ra , e.g., 50% formamide, 5x or 
10 6x SCC. Hybridization requires that the two nucleic acids contain complementary 
sequences, although depending on the stringency of the hybridization, mismatches 
Ct between bases are possible. The appropriate stringency for hybridizing nucleic 

Qi acids depends on the length of the nucleic acids and the degree of 

p complementation, variables well known in the art. The greater the degree of 

ffj 15 similarity or homology between two nucleotide sequences, the greater the value of 

%A T m for hybrids of nucleic acids having those sequences. The relative stability 

r = (corresponding to higher T^ of nucleic acid hybridizations decreases in the 

O following order: RNA:RNA, DNA:RNA, DNA:DNA. For hybrids of greater 

p than 100 nucleotides in length, equations for calculating T m have been derived (see 

2 20 Sambrook et al., supra, 9.50-0.51). For hybridization with shorter nucleic acids, 

i.e., oligonucleotides, the position of mismatches becomes more important, and the 
length of the oligonucleotide determines its specificity (see Sambrook et al., supra, 
11.7-11.8). Preferably a minimum length for a hybridizable nucleic acid is at 
least about 10 nucleotides; more preferably at least about 15 nucleotides; most 
25 preferably the length is at least about 20 nucleotides. 

"Homologous recombination" refers to the insertion of a foreign DNA sequence of 
a vector in a chromosome. Preferably, the vector targets a specific chromosomal 
site for homologous recombination. For specific homologous recombination, the 
vector will contain sufficiently long regions of homology to sequences of the 
30 chromosome to allow complementary binding and incorporation of the vector into 



} 
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the chromosome. Longer regions of homology, and greater degrees of sequence 
similarity, may increase the efficiency of homologous recombination. 

A DNA "coding sequence" is a double-stranded DNA sequence which is 
transcribed and translated into a polypeptide in a cell in vitro or in vivo when 
5 placed under the control of appropriate regulatory sequences. The boundaries of 
the coding sequence are determined by a start codon at the 5' (amino) terminus 
and a translation stop codon at the 3' (carboxyl) terminus. A coding sequence can 
include, but is not limited to, prokaryotic sequences, cDNA from eukaryotic 
mRNA, genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and 
10 even synthetic DNA sequences. If the coding sequence is intended for expression 
in a eukaryotic cell, a polyadenylation signal and transcription termination 
sequence will usually be located 3' to the coding sequence. 

Transcriptional and translational control sequences are DNA regulatory sequences, 
such as promoters, enhancers, terminators, and the like, that provide for the 
15 expression of a coding sequence in a host cell. In eukaryotic cells, 
polyadenylation signals are control sequences. 

A coding sequence is "under the control" of transcriptional and translational 
control sequences in a cell when RNA polymerase transcribes the coding sequence 
into mRNA, which is then trans-RNA spliced and translated into the protein 
20 encoded by the coding sequence. 

A "signal sequence" is included at the beginning of the coding sequence of a 
protein to be expressed on the surface of a cell. This sequence encodes a signal 
peptide, N-terminal to the mature polypeptide, that directs the host cell to 
translocate the polypeptide. The term "translocation signal sequence" is also used 
25 herein to refer to this sort of signal sequence. Translocation signal sequences can 
be found associated with a variety of proteins native to eukaryotes and 
prokaryotes, and are often functional in both types of organisms. 
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A DNA sequence is "operatively linked" to an expression control sequence when 
the expression control sequence controls and regulates the transcription and 
translation of that DNA sequence. The term "operatively linked" includes having 
an appropriate start signal (e.g., ATG) in front of the DNA sequence to be 
5 expressed and maintaining the correct reading frame to permit expression of the 
DNA sequence under the control of the expression control sequence and 
production of the desired product encoded by the DNA sequence. If a gene that 
one desires to insert into a recombinant DNA molecule does hot contain an 
appropriate start signal, such a start signal can be inserted upstream (5') of and in 
10 reading frame with the gene. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3' direction) 
coding sequence. For purposes of defining the present invention, the promoter 
sequence is bounded at its 3' terminus by the transcription initiation site and 
extends upstream (5' direction) to include the minimum number of bases or 
elements necessary to initiate transcription at levels detectable above background. 
Within the promoter sequence will be found a transcription initiation site 
(conveniently defined for example, by mapping with nuclease SI), as well as 
protein binding domains (consensus sequences) responsible for the binding of RNA 
polymerase. 

The term "standard hybridization conditions" refers to salt and temperature 
conditions substantially equivalent to 5 x SSC and 65 °C for both hybridization and 
wash. 

A molecule is "antigenic" when it is capable of specifically interacting with an 
25 antigen recognition molecule of the immune system, such as an immunoglobulin 
(antibody) or T cell antigen receptor. An antigenic polypeptide contains at least 
about 5, and preferably at least about 10, amino acids. An antigenic portion of a 
molecule can be that portion that is immunodominant for antibody or T cell 
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receptor recognition, or it can be a portion used to generate an antibody to the 
molecule by conjugating the antigenic portion to a carrier molecule for 
immunization. A molecule that is antigenic need not be itself immunogenic, i.e., 
capable of eliciting an immune response without a carrier. 

An "antibody" is any immunoglobulin, including antibodies and fragments thereof, 
that binds a specific epitope. The term encompasses polyclonal, monoclonal, and 
chimeric antibodies, the last mentioned described in further detail in U.S. Patent 
Nos. 4,816,397 and 4,816,567, as well as antigen binding portions of antibodies, 
including Fab, F(ab*) 2 and Fr (including single chain antibodies). Accordingly, 
the phrase "antibody molecule" in its various grammatical forms as used herein 
contemplates both an intact immunoglobulin molecule and an immunologically 
active portion of an immunoglobulin molecule containing the antibody combining 
site. An "antibody combining site" is that structural portion of an antibody 
molecule comprised of heavy and light chain variable and hypervariable regions 
that specifically binds antigen. 

Exemplary antibody molecules are intact immunoglobulin molecules, substantially 
intact immunoglobulin molecules and those portions of an immunoglobulin 
molecule that contains the paratope, including those portions known in the art as 
Fab, Fab', F(ab') 2 and F(v), which portions are preferred for use in the 
therapeutic methods described herein. 

Fab and F(ab') 2 portions of antibody molecules are prepared by the proteolytic 
reaction of papain and pepsin, respectively, on substantially intact antibody 
molecules by methods that are well-known. See for example, U.S. Patent No. 
4,342,566 to Theofilopolous et al. Fab' antibody molecule portions are also well- 
known and are produced from F(ab') 2 portions followed by reduction of the 
disulfide bonds linking the two heavy chain portions as with mercaptoethanol, and 
followed by alkylation of the resulting protein mercaptan with a reagent such as 
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iodoacetamide. An antibody containing intact antibody molecules is preferred 
herein. 

The phrase "monoclonal antibody" in its various grammatical forms refers to an 
antibody having only one species of antibody combining site capable of 
5 immunoreacting with a particular antigen. A monoclonal antibody thus typically 
displays a single binding affinity for any antigen with which it immunoreacts. A 
monoclonal antibody may therefore contain an antibody molecule having a 
plurality of antibody combining sites, each immunospecific for a different antigen; 
e.g., a bispecific (chimeric) monoclonal antibody. 



s** 10 A composition comprising "A" (where "A" is a single protein, DNA molecule, 

-J3 vector, recombinant host cell, etc.) is substantially free of "B" (where "B" 

CP 

ED comprises one or more contaminating proteins, DNA molecules, vectors, etc., but 

CP 

m excluding racemic forms of A) when at least about 75% by weight of the proteins, 
DNA, vectors (depending on the category of species to which A and B belong) in 

e 15 the composition is "A". Preferably, "A" comprises at least about 90% by weight 

%Z_ of the A+B species in the composition, most preferably at least about 99% by 

H; weight. It is also preferred that a composition, which is substantially free of 

□ contamination, contain only a single molecular weight species having the activity 

w or characteristic of the species of interest. 



20 The phrase "pharmaceutical^ acceptable" refers to molecular entities and 

compositions that are physiologically tolerable and do not typically produce an 
allergic or similar untoward reaction, such as gastric upset, dizziness and the like, 
when administered to a human. Preferably, as used herein, the term 
"pharmaceutical^ acceptable" means approved by a regulatory agency of the 

25 Federal or a state government or listed in the U.S. Pharmacopeia or other 

generally recognized pharmacopeia for use in animals, and more particularly in 
humans. The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle 
with which the compound is administered. Such pharmaceutical carriers can be 
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sterile liquids, such as water and oils, including those of petroleum, animal, 
vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame 
oil and the like. Water or aqueous solution saline solutions and aqueous dextrose 
and glycerol solutions are preferably employed as carriers, particularly for 
injectable solutions. Suitable pharmaceutical carriers are described in 
"Remington's Pharmaceutical Sciences" by E.W. Martin. 

The phrase "therapeutically effective amount" is used herein to mean an amount 
sufficient to reduce by at least about 15 percent, preferably by at least 50 percent, 
more preferably by at least 90 percent, and most preferably prevent, a clinically 
significant deficit in the activity, function and response of the host. Alternatively, 
a therapeutically effective amount is sufficient to cause an improvement in a 
clinically significant condition in the host. 

The term "adjuvant" refers to a compound or mixture that enhances the immune 
response to an antigen. An adjuvant can serve as a tissue depot that slowly 
releases the antigen and also as a lymphoid system activator that non-specifically 
enhances the immune response (Hood et al., Immunology, Second Ed. y 1984, 
Benjamin/Cummings: Menlo Park, California, p. 384). Often, a primary 
challenge with an antigen alone, in the absence of an adjuvant, will fail to elicit a 
humoral or cellular immune response. Adjuvants include, but are not limited to, 
complete Freund's adjuvant, incomplete Freund's adjuvant, saponin, mineral gels 
such as aluminum hydroxide, surface active substances such as lysolecithin, 
pluronic polyols, polyanions, peptides, oil or hydrocarbon emulsions, keyhole 
limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as 
BCG (bacille Calmette-Gueriri) and Corynebacterium parvum. Preferably, the 
adjuvant is pharmaceutical^ acceptable. 

In its primary aspect, the present invention is directed to the identification of 
materials that function as modulators of mammalian body weight. In particular, 
the invention concerns the isolation, purification and sequencing of certain nucleic 
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acids that correspond to the ob gene in both mice and humans, as well as the 
corresponding polypeptides expressed by these nucleic acids. The invention thus 
comprises the discovery of nucleic acids having the nucleotide sequences set forth 
in FIGURE 1 (SEQ ID NO:l) and FIGURE 2 (SEQ ID NO:3), and to degenerate 
variants, alleles and fragments thereof, all possessing the activity of modulating 
body weight and adiposity. The correspondence of the present nucleic acids to the 
ob gene portends their significant impact on conditions such as obesity as well as 
other maladies and dysfunctions where abnormalities in body weight are a 
contributory factor. The invention extends to the proteins expressed by the nucleic 
acids of the invention, and particularly to those proteins set forth in FIGURE 1 
(SEQ ID NO:2), FIGURE 3 (SEQ ID NO:4), FIGURE 5 (SEQ ID NO:5), and 
FIGURE 6 (SEQ ID NO:6), as well as conserved variants, active fragments, and 
cognate small molecules. 

In particular, the present invention contemplates that naturally occurring fragments 
of the ob polypeptide may be important. The peptide sequence includes a number 
of sites that are frequently the target for proteolytic cleavage, e.g. , arginine 
residues. It is possible that the full length polypeptide may be cleaved at one or 
more such sites to form biologically active fragments. Such biologically active 
fragments may either agonize or antagonize the functional activity of the ob 
polypeptide to reduce body weight. 

As discussed earlier, the weight control modulator peptides or their binding 
partners or other ligands or agents exhibiting either mimicry or antagonism to 
them or control over their production, may be prepared in pharmaceutical 
compositions, with a suitable carrier and at a strength effective for administration 
by various means to a patient experiencing abnormal fluctuations in body weight 
or adiposity, either alone or as part of an adverse medical condition such as cancer 
or AIDS, for the treatment thereof. A variety of administrative techniques may be 
utilized, among them parenteral techniques such as subcutaneous, intravenous and 
intraperitoneal injections, catheterizations and the like. Average quantities of the 
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recognition factors or their subunits may vary and in particular should be based 
upon the recommendations and prescription of a qualified physician or 
veterinarian. 

Also, antibodies including both polyclonal and monoclonal antibodies, and drugs 
that modulate the production or activity of the weight control modulators 
recognition factors and/or their subunits may possess certain diagnostic 
applications and may for example, be utilized for the purpose of detecting and/or 
measuring conditions where abnormalities in body weight are or may be likely to 
develop. For example, the modulator peptides or their active fragments may be 
used to produce both polyclonal and monoclonal antibodies to themselves in a 
variety of cellular media, by known techniques such as the hybridoma technique 
utilizing, for example, fused mouse spleen Lymphocytes and myeloma cells. These 
techniques are described in detail below. Likewise, small molecules that mimic or 
antagonize the activity(ies) of the receptor recognition factors of the invention may 
be discovered or synthesized, and may be used in diagnostic and/or therapeutic 
protocols. 

Panels of monoclonal antibodies produced against modulator peptides can be 
screened for various properties; i.e., isotype, epitope, affinity, etc. Of particular 
interest are monoclonal antibodies that neutralize the activity of the modulator 
peptides. Such monoclonals can be readily identified in activity assays for the 
weight modulators. High affinity antibodies are also useful when immunoaffinity 
purification of native or recombinant modulator is possible. 

Preferably, the anti-modulator antibody used in the diagnostic and therapeutic 
methods of this invention is an affinity purified polyclonal antibody. More 
preferably, the antibody is a monoclonal antibody (mAb). In addition, it is 
preferable for the anti-modulator antibody molecules used herein be in the form of 
Fab, Fab', F(ab') 2 or F(v) portions of whole antibody molecules. 
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As suggested earlier, a diagnostic method useful in the present invention comprises 
examining a cellular sample or medium by means of an assay including an 
effective amount of an antagonist to a modulator protein, such as an anti- 
modulator antibody, preferably an affinity-purified polyclonal antibody, and more 
5 preferably a mAb. In addition, it is preferable for the anti-modulator antibody 
molecules used herein be in the form of Fab, Fab', F(ab') 2 or F(v) portions or 
whole antibody molecules. As previously discussed, patients capable of benefiting 
from this method include those suffering from cancer, AIDS, obesity or other 
condition where abnormal body weight is a characteristic or factor. Methods for 
10 isolating the modulator and inducing anti-modulator antibodies and for determining 
and optimizing the ability of anti-modulator antibodies to assist in the examination 
of the target cells are all well-known in the art. 

The nucleic acids contemplated by the present invention extend as indicated, to 
other nucleic acids that code on expression for peptides such as those set forth in 

15 FIGURE 1 (SEQ ID NO:2), FIGURE 3 (SEQ ID NO:4), FIGURE 5 (SEQ ID 

NO:5), and FIGURE 6 (SEQ ID NO: 6) herein. Accordingly, while specific DNA 
has been isolated and sequenced in relation to the ob gene, any animal cell 
potentially can serve as the nucleic acid source for the molecular cloning of a gene 
encoding the peptides of the invention. The DNA may be obtained by standard 

20 procedures known in the art from cloned DNA (e.g. , a DNA "library"), by 
chemical synthesis, by cDNA cloning, or by the cloning of genomic DNA, or 
fragments thereof, purified from the desired cell (See, for example, Sambrook et 
al., 1989, supra; Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, 
MRL Press, Ltd., Oxford, U.K. Vol. I, II). Clones derived from genomic DNA 

25 may contain regulatory and intron DNA regions in addition to coding regions; 
clones derived from cDNA will not contain intron sequences. Whatever the 
source, the gene should be molecularly cloned into a suitable vector for 
propagation of the gene. 
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In the molecular cloning of the gene from genomic DNA, the genomic DNA can 
be amplified using primers selected from the cDNA sequences. Alternatively, 
DNA fragments are generated, some of which will encode the desired gene. The 
DNA may be cleaved at specific sites using various restriction enzymes. One may 
also use DNAse in the presence of manganese to fragment the DNA, or the DNA 
can be physically sheared, as for example, by sonication. The linear DNA 
fragments can then be separated according to size by standard techniques, 
including but not limited to, agarose and polyacrylamide gel electrophoresis and 
column chromatography. 

Once the DNA fragments are generated, identification of the specific DNA 
fragment containing the desired ob or oft-like gene may be accomplished in a 
number of ways. For example, if an amount of a portion of a ob or ob-likc gene 
or its specific RNA, or a fragment thereof, is available and can be purified and 
labeled, the generated DNA fragments may be screened by nucleic acid 
hybridization to the labeled probe (Benton and Davis, 1977, Science 196:180; 
Grunstein and Hogness, 1975, Proc. Natl. Acad. Sci. U.S.A. 72:3961). The 
present invention provides such nucleic acid probes, which can be conveniently 
prepared from the specific sequences disclosed herein, e.g., a hybridizable probe 
having a nucleotide sequence corresponding to at least a 10, and preferably a 15, 
nucleotide fragment of the sequences depicted in Figure 1 (SEQ ID NO:l) or 
Figure 2 (SEQ ID NO:3). Preferably, a fragment is selected that is highly unique 
to the modulator peptides of the invention. Those DNA fragments with substantial 
homology to the probe will hybridize. As noted above, the greater the degree of 
homology, the more stringent hybridization conditions can be used. In one 
embodiment, low stringency hybridization conditions are used to identify a 
homologous modulator peptide. However, in a preferred aspect, and as 
demonstrated experimentally herein, a nucleic acid encoding a modulator peptide 
of the invention will hybridize to a nucleic acid having a nucleotide sequence such 
as depicted in Figure 1 (SEQ ID NO:l) or Figure 2 (SEQ ID NO:3), or a 



; 
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hybridizable fragment thereof, under moderately stringent conditions; more 
j preferably, it will hybridize under high stringency conditions. 

Alternatively, the presence of the gene may be detected by assays based on the 
physical, chemical, or immunological properties of its expressed product. For 
5 example, cDNA clones, or DNA clones which hybrid-select the proper mRNAs, 
can be selected which produce a protein that, e.g. , has similar or identical 
electrophoretic migration, isoelectric focusing behavior, proteolytic digestion 
maps, tyrosine phosphatase activity or antigenic properties as known for the 
present modulator peptides. For example, the antibodies of the instant invention 
10 can conveniently be used to screen for homologs of modulator peptides from other 
sources. 

A gene encoding a modulator peptide of the invention can also be identified by 

mRNA selection, i.e., by nucleic acid hybridization followed by in 

vitro translation. In this procedure, fragments are used to isolate complementary 

15 mRNAs by hybridization. Such DNA fragments may represent available, purified 
modulator DNA. Immunoprecipitation analysis or functional assays (e.g., tyrosine 
phosphatase activity) of the in vitro translation products of the products of the 
isolated mRNAs identifies the mRNA and, therefore, the complementary DNA 
fragments, that contain the desired sequences. In addition, specific mRNAs may 

20 be selected by adsorption of polysomes isolated from cells to immobilized 
antibodies specifically directed against a modulator peptide. 

A radiolabeled modulator peptide cDNA can be synthesized using the selected 
mRNA (from the adsorbed polysomes) as a template. The radiolabeled mRNA or 
cDNA may then be used as a probe to identify homologous modulator peptide 
25 DNA fragments from among other genomic DNA fragments. 

Another feature of this invention is the expression of the DNA sequences disclosed 
herein. As is well known in the art, DNA sequences may be expressed by 
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operatively linking them to an expression control sequence in an appropriate 
expression vector and employing that expression vector to transform an 
appropriate unicellular host. 

Such operative linking of a DNA sequence of this invention to an expression 
control sequence, of course, includes, if not already part of the DNA sequence, 
the provision of an initiation codon, ATG, in the correct reading frame upstream 
of the DNA sequence. 

A wide variety of host/expression vector combinations may be employed in 
expressing the DNA sequences of this invention. Useful expression vectors, for 
example, may consist of segments of chromosomal, non-chromosomal and 
Synthetic DNA sequences. Suitable vectors include derivatives of SV40 and 
known bacterial plasmids, e.g., E. coli plasmids col El, pCRl, pBR322, pMB9, 
pUC or pUC plasmid derivatives, e.g., pGEX vectors, pET vectors, pmal-c, 
pFLAG, etc., and their derivatives, plasmids such as RP4; phage DNAs, e.g., the 
numerous derivatives of phage A, e.g., NM989, and other phage DNA, e.g., M13 
and Filamentous single stranded phage DNA; yeast plasmids such as the 2/x 
plasmid or derivatives thereof; vectors useful in eukaryotic cells, such as vectors 
useful in insect or mammalian cells; vectors derived from combinations of 
plasmids and phage DNAs, such as plasmids that have been modified to employ 
phage DNA or other expression control sequences; and the like. In a preferred 
embodiment, expression of ob is achieved in methylotrophic yeast, e.g., Pichia 
pastoris yeast {see, e.g., International Patent Publication No. WO 90/03431, 
published 5 April 1990, by Brierley et al.; International Patent Publication No. 
WO 90/10697, published 20 September 1990, by Siegel et al.). In a specific 
embodiment, infra, an expression vector is engineered for expression of ob under 
control of the cx-mating factor signal sequence. 

Any of a wide variety of expression control sequences — sequences that control the 
expression of a DNA sequence operatively linked to it — may be used in these 
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vectors to express the DNA sequences of this invention. Such useful expression 
control sequences include, for example, the early or late promoters of SV40, 
CMV, vaccinia, polyoma or adenovirus, the lac system, the trp system, the TAC 
system, the TRC system, the LTR system, the major operator and promoter regions 
5 of phage X, the control regions of fd coat protein, the promoter for 

3 -phosphogly cerate kinase or other glycolytic enzymes, the promoters of acid 
phosphatase (e.g., Pho5), the AOX 1 promoter of methylotrophic yeast, the 
promoters of the yeast a-mating factors, and other sequences known to control the 
expression of genes of prokaryotic or eukaryotic cells or their viruses, and various 
10 combinations thereof. 

A wide variety of unicellular host cells are also useful in expressing the DNA 
sequences of this invention. These hosts may include well known eukaryotic and 
prokaryotic hosts, such as strains of E. coli, Pseudomonas, Bacillus, Streptomyces; 
fungi such as yeasts (Saccharomyces , and methylotrophic yeast such as Pichia, 
Candida, Hansenula, and Torulopsis); and animal cells, such as CHO, Rl.l, B-W 
and L-M cells, African Green Monkey kidney cells (e.g., COS 1, COS 7, BSC1, 
BSC40, and BMT10), insect cells (e.g., Sf9), and human cells and plant cells in 
tissue culture. 

It will be understood that not all vectors, expression control sequences and hosts 
20 will function equally well to express the DNA sequences of this invention. 
Neither will all hosts function equally well with the same expression system. 
However, one skilled in the art will be able to select the proper vectors, 
expression control sequences, and hosts without undue experimentation to 
accomplish the desired expression without departing from the scope of this 
25 invention. For example, in selecting a vector, the host must be considered 

because the vector must function in it. The vector's copy number, the ability to 
control that copy number, and the expression of any other proteins encoded by the 
vector, such as antibiotic markers, will also be considered. 
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In selecting an expression control sequence, a variety of factors will normally be 
considered. These include, for example, the relative strength of the system, its 
controllability, and its compatibility with the particular DNA sequence or gene to 
be expressed, particularly as regards potential secondary structures. Suitable 
unicellular hosts will be selected by consideration of, e.g., their compatibility with 
the chosen vector, their secretion characteristics, their ability to fold proteins 
correctly, and their fermentation requirements, as well as the toxicity to the host 
of the product encoded by the DNA sequences to be expressed, and the ease of 
purification of the expression products. 

Considering these and other factors a person skilled in the art will be able to 
construct a variety of vector/expression control sequence/host combinations that 
will express the DNA sequences of this invention on fermentation or in large scale 
animal culture. 

In a specific embodiment, an ob fusion protein can be expressed. An ob fusion 
protein comprises at least a functionally active portion of a non-ob protein joined 
via a peptide bond to at least a functionally active portion of an ob polypeptide. 
The non-ob sequences can be amino- or carboxy-terminal to the ob sequences. 
More preferably, for stable expression of a proteolytically inactiveob fusion 
protein, the portion of the non-ob fusion protein is joined via a peptide bond to the 
amino terminus of the ob protein. A recombinant DNA molecule encoding such a 
fusion protein comprises a sequence encoding at least a functionally active portion 
of a non-ob protein joined in-frame to the ob coding sequence, and preferably 
encodes a cleavage site for a specific protease, e.g., thrombin or Factor Xa, 
preferably at the ob-non-ob juncture. In a specific embodiment, the fusion protein 
is expressed in Escherichia coli or in P. pastoris. 

In a specific embodiment, infra, vectors were prepared to express the murine and 
human ob genes, with and without the codon for gln-49, in bacterial expression 
systems and yeast (Pichia) expression systems as fusion proteins. The ob gene is 
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prepared with an endonuclease cleavage site, e.g., using PCR and novel primers. 
It is desirable to confirm sequences generated by PCR, since the probability of 
including a point mutation is greater with this technique. A plasmid containing a 
histidine tag (HIS-TAG) and a proteolytic cleavage site is used. The presence of 
the histidine makes possible the selective isolation of recombinant proteins on a 
Ni-chelation column, or by affinity purification. The proteolytic cleavage site, in 
a specific embodiment, infra, a thrombin cleavage site, is engineered so that 
treatment with the protease, e.g. , thrombin, will release the full length mature 
{i.e., lacking a signal sequence) ob polypeptide. 

In another aspect, the pGEX vector (Smith and Johnson, 1988, Gene 67:31-40) 
can be used. This vector fuses the schistosoma japonicum glutathionine S- 
transferase cDNA to the sequence of interest. Bacterial proteins are harvested and 
recombinant proteins can be quickly purified on a reduced glutathione affinity 
column. The GST carrier can subsequently be cleaved from fusion proteins by 
digestion with site-specific proteases. After cleavage, the carrier and uncleaved 
fusion protein can be removed by absorption on glutathione agarose. Difficulty 
with the system occasionally arises when the encoded protein is insoluble in 
aqueous solutions. 

Expression of recombinant proteins in bacterial systems may result in incorrect 
folding of the expressed protein, requiring refolding. The recombinant protein can 
be refolded prior to or after cleavage to form a functionally active ob polypeptide. 
The ob polypeptide may be refolded by the steps of (i) incubating the protein in a 
denaturing buffer that contains a reducing agent, and then (ii) incubating the 
protein in a buffer that contains an oxidizing agent, and preferably also contains a 
protein stabilizing agent or a chaotropic agent, or both. Suitable redox 
(reducing/oxidizing) agent pairs include, but are not limited to, reduced 
glutathione/glutathione disulfide, cystine/cysteine, cystamine/cysteamine, and 2- 
mercaptoethanol/2-hydroxyethyldisulfide. In a particular aspect, the fusion protein 
can be solubilized in a denaturarit, such as urea, prior to exchange into the 
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reducing buffer. In preferred embodiment, the protein is also purified, e.g., by 
ion exchange or Ni-chelation chromatography, prior to exchange into the reducing 
buffer. Denaturing agents include but are not limited to urea and guanidine-HCl. 
The recombinant protein is then diluted about at least 10-fold, more preferably 
5 about 100-fold, into an oxidizing buffer that contains an oxidizing agent, such as 
but not limited to 0.1 M Tris-HCl, pH 8.0, 1 mM EDTA, 0.15 M NaCl, 0.3 M 
oxidized glutathione. The fusion protein is then incubated for about 1 to about 24 
hours, preferably about 2 to about 16 hours, at room temperature in the oxidizing 
buffer. The oxidizing buffer may comprise a protein stabilizing agent, e.g., a 

10 sugar, an alcohol, or ammonium sulfate. The oxidizing buffer may further 
comprises a chaotropic agent at low concentration, to destabilize incorrect 
intermolecular interactions and thus promote proper folding. Suitable chaotropic 
agents include but are not limited to a detergent, a polyol, L-arginine, guanidine- 
HCl and polyethylene glycol (PEG). It is important to use a low enough 

15 concentration of the chaotropic agent to avoid denaturing the protein. The 

refolded protein can be concentrated by at least about 10-fold, more preferably by 
the amount it was diluted into the oxidizing buffer. 

Bacterial fermentation processes can also result in a recombinant protein 
preparation that contains unacceptable levels of endotoxins. Therefore, the 
20 invention contemplates removal of such endotoxins, e.g. , by using endotoxin- 
specific antibodies or other endotoxin binding molecules. The presence of 
endotoxins can be determined by standard techniques, such as by employing E- 
TOXATE Reagents (Sigma), or with bioassays. 

In addition to the specific example, the present inventors contemplate use of 
25 baculovirus, mammalian, and yeast expression systems to express the ob protein. 
For example, in baculovirus expression systems, both non-fusion transfer vectors, 
such as but not limited to pVL941 (BamHl cloning site; Summers), pVL1393 
(BamHl, Smal, Xbal, EcoRl, NotI, Xmalll, Bglll, and PstI cloning site; 
Invitrogen), pVL1392 (Bglll, PstI, NotI, Xmalll, EcoRI, Xbal, Smal, and BamHl 
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cloning site; Summers and Invitrogen), and pBlueBacIII (BamHl, Bglll, PstI, 
Ncol, and HindlH cloning site, with blue/white recombinant screening possible; 
Invitrogen), and fusion transfer vectors, such as but not limited to pAc700 
(BamHl and Kpnl cloning site, in which the BamHl recognition site begins with 
5 the initiation codon; Summers), pAc701 and pAc702 (same as pAc700, with 

different reading frames), pAc360 (BamHl cloning site 36 base pairs downstream 
of a polyhedrin initiation codon; Invitrogen(195)), and pBlueBacHisA; B, C (three 
different reading frames, with BamHl, Bglll, PstI, Ncol, and HindHI cloning site, 
an N-terminal peptide for ProBond purification, and blue/white recombinant 
10 screening of plaques; Invitrogen (220)). 

O Mammalian expression vectors contemplated for use in the invention include 

. ^~ 

— vectors with inducible promoters, such as dihydrofolate reductase (DHFR), e.g., 

^ any expression vector with a DHFR expression vector, or a DHFR/methotrexate 

Si co-amplification vector, such as pED (PstI, Sail, Sbal, Smal, and EcoRI cloning 

^] 15 site, with the vector expressing both the cloned gene and DHFR; see Kaufman, 

s E Current Protocols in Molecular Biology, 16.12, 1991). Alternatively, a glutamine 

Q synthetase/methionine sulfoximine co-amplification vector, such as pEE14 

(HindHI, Xbal, Smal, Smal, EcoRI, and Bell cloning site, in which the vector 
y expresses glutamine synthase and the cloned gene; Celltech). In another 

20 embodiment, a vector that directs episomal expression under control of Epstein 
Barr Virus (EBV) can be used, such as pREP4 (BamHl, Sfil, Xhol, NotI, Nhel, 
HindHI, Nhel, PvuII, and Kpnl cloning site, constitutive RSV LTR promoter, 
hygromycin selectable marker; Invitrogen), pCEP4 (BamHl, Sfil, Xhol, NotI, 
Nhel, HindHI, Nhel, PvuII, and Kpnl cloning site, constitutive hCMV immediate 
25 early gene, hygromycin selectable marker; Invitrogen), pMEP4 (Kpnl, Pvul, 

Nhel, HindHI, NotI, Xhol, Sfil, BamHl cloning site, inducible methallothionein 
Ha gene promoter, hygromycin selectable marker: Invitrogen), pREP8 (BamHl, 
Xhol, NotI, HindHI, Nhel, and Kpnl cloning site, RSV LTR promoter, histidinol 
selectable marker; Invitrogen), pREP9 (Kpnl, Nhel, HindHI, NotI, Xhol, Sfil, and 
30 BamHl cloning site, RSV LTR promoter, G418 selectable marker; Invitrogen), 
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and pEBVHis (RSV LTR promoter, hygromycin selectable marker, N-terminal 
peptide purifiable via ProBond resin and cleaved by enterokinase; Invitrogen). 
Selectable mammalian expression vectors for use in the invention include 
pRc/CMV (Hindlll, BstXI, NotI, Sbal, and Apal cloning site, G418 selection; 
Invitrogen), pRc/RSV (Hindlll, Spel, BstXI, NotI, Xbal cloning site, G418 
selection; Invitrogen), and others. Vaccinia virus mammalian expression vectors 
(see, Kaufman, supra) for use according to the invention include but are not 
limited to pSCll (Smal cloning site, TK- and /?-gal selection), pMJ601 (Sail, 
Smal, Afll, Narl, BspMII, BamHI, Apal, Nhel, SacII, Kpnl, and Hindlll cloning 
site; TK- and /?-gal selection), and pTKgptFIS (EcoRI, PstI, Sail, AccI, Hindll, 
Sbal, BamHI, and HpA cloning site, TK or XPRT selection). 

Yeast expression systems can also be used according to the invention to express ob 
polypeptide. For example, the non-fusion pYES2 vector (Xbal, SphI, Shol, NotI, 
GstXI, EcoRI, BstXI, BamHI, SacI, Kpnl, and Hindlll cloning sit; Invitrogen) or 
the fusion pYESHisA, B, C (Xbal, SphI, Shol, NotI, BstXI, EcoRI, BamHI, 
SacI, Kpnl, and Hindlll cloning site, N-terminal peptide purified with ProBond 
resin and cleaved with enterokinase; Invitrogen), to mention just two, can be 
employed according to the invention. 

It is further intended that body weight modulator peptide analogs may be prepared 
from nucleotide sequences derived within the scope of the present invention. 
Analogs, such as fragments, may be produced, for example, by pepsin digestion of 
weight modulator peptide material. Other analogs, such as muteins, can be 
produced by standard site-directed mutagenesis of weight modulator peptide coding 
sequences. Analogs exhibiting "weight modulator activity" such as small 
molecules, whether functioning as promoters or inhibitors, may be identified by 
known in vivo and/or in vitro assays. 

In addition to recombinant expression of ob polypeptide, the present invention 
envisions and fully enables preparation of ob polypeptide, or fragments thereof, 
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using the well known and highly developed techniques of solid phase peptide 
synthesis. The invention contemplates using both the popular Boc and Fmoc, as 
well as other protecting group strategies, for preparing ob polypeptide or 
fragments thereof. Various techniques for refolding and oxidizing the cysteine 
side chains to form a disulfide bond are also well known in the art. 

As mentioned above, a DNA sequence encoding weight modulator peptides as 
disclosed herein can be prepared synthetically rather than cloned. The DNA 
sequence can be designed with the appropriate codons for the weight modulator 
peptide amino acid sequences. In general, one will select preferred codons for the 
intended host if the sequence will be used for expression. The complete sequence 
is assembled from overlapping oligonucleotides prepared by standard methods and 
assembled into a complete coding sequence. See, e.g., Edge, Nature, 292:756 
(1981); Nambair et al., Science, 223:1299 (1984); Jay et al., /. Biol. Chem., 
259:6311 (1984). 

Synthetic DNA sequences allow convenient construction of genes which will 
express weight modulator analogs or "muteins". Alternatively, DNA encoding 
muteins can be made by site-directed mutagenesis of native modulator genes or 
cDNAs, and muteins can be made directly using conventional polypeptide 
synthesis. 

A general method for site-specific incorporation of unnatural amino acids into 
proteins is described in Christopher J. Noren, Spencer J. Anthony-Cahill, Michael 
C. Griffith, Peter G. Schultz, Science, 244:182-188 (April 1989). This method 
may be used to create analogs of the ob polypeptide with unnatural amino acids. 

The present invention extends to the preparation of antisense nucleotides and 
ribozymes that may be used to interfere with the expression of the weight 
modulator proteins at the translational level. This approach utilizes antisense 
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nucleic acid and ribozymes to block translation of a specific mRNA, either by 
masking that mRNA with an antisense nucleic acid or cleaving it with a ribozyme. 



Antisense nucleic acids are DNA or RNA molecules that are complementary to at 
least a portion of a specific mRNA molecule (See Weintraub, 1990; 
5 Marcus-Sekura, 1988). In the cell, they hybridize to that mRNA, forming a 
double stranded molecule. The cell does not translate an mRNA in this 
double-stranded form. Therefore, antisense nucleic acids interfere with the 
expression of mRNA into protein. Oligomers of about fifteen nucleotides and 
molecules that hybridize to the AUG initiation codon will be particularly efficient, 
10 since they are easy to synthesize and are likely to pose fewer problems than larger 
molecules when introducing them into weight modulator peptide-producing cells. 
Antisense methods have been used to inhibit the expression of many genes in vitro 
05 (Marcus-Sekura, 1988; Hambor et al., 1988). 



Ribozymes are RNA. molecules possessing the ability to specifically cleave other 
15 single stranded RNA molecules in a manner somewhat analogous to DNA 

restriction endonucleases. Ribozymes were discovered from the observation that 
certain mRNAs have the ability to excise their own introns. By modifying the 
nucleotide sequence of these RNAs, researchers have been able to engineer 
molecules that recognize specific nucleotide sequences in an RNA molecule and 
20 cleave it (Cech, 1988). Because they are sequence-specific, only mRNAs with 
particular sequences are inactivated. 



Investigators have identified two types of ribozymes, Tetrahymena-typz and 
"hammerhead"-type (Hasselhoff and Gerlach, 1988). Tetrahymena-typz ribozymes 
recognize four-base sequences, while "hammerhead"-type recognize eleven- to 
25 eighteen-base sequences. The longer the recognition sequence, the more likely it 
is to occur exclusively in the target MRNA species. Therefore, hammerhead-type 
ribozymes are preferable to Tetrahymena-typt ribozymes for inactivating a specific 
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mRNA species, and eighteen base recognition sequences are preferable to shorter 
recognition sequences. 

The DNA sequences described herein may thus be used to prepare antisense 
molecules against and ribozymes that cleave mRNAs for weight modulator 
5 proteins and their ligands, thus inhibiting expression of the ob gene, and leading to 
increased weight gain and adiposity. 

The present invention also relates to a variety of diagnostic applications, including 
methods for detecting the presence of conditions and/or stimuli that impact 
abnormalities in body weight or adiposity, by reference to their ability to elicit the 
10 activities which are mediated by the present weight modulators. As mentioned 
earlier, the weight modulator peptides can be used to produce antibodies to 
themselves by a variety of known techniques, and such antibodies could then be 
isolated and utilized as in tests for the presence of particular transcriptional activity 
in suspect target cells. 

15 Antibody(ies) to the body weight modulators, i.e., the ob polypeptide, can be 
produced and isolated by standard methods including the well known hybridoma 
techniques. For convenience, the antibody(ies) to the weight modulators will be 
referred to herein as Ab t and antibody(ies) raised in another species as Ab 2 . 

According to the invention, ob polypeptide produced recombinantly or by chemical 
20 synthesis, and fragments or other derivatives or analogs thereof, including fusion 
proteins, may be used as an immunogen to generate antibodies that recognize the 
ob polypeptide. Such antibodies include but are not limited to polyclonal, 
monoclonal, chimeric, single chain, Fab fragments, and an Fab expression library. 



25 



Various procedures known in the art may be used for the production of polyclonal 
antibodies to ob polypeptide a recombinant PTP or derivative or analog thereof. 
For the production of antibody, various host animals can be immunized by 
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injection with the ob polypeptide, or a derivative (e.g., fragment or fusion protein) 
thereof, including but not limited to rabbits, mice, rats, sheep, goats, etc. In one 
embodiment, the ob polypeptide or fragment thereof can be conjugated to an 
immunogenic carrier, e.g., bovine serum albumin (BSA) or keyhole limpet 
hemocyanin (KLH). Various adjuvants may be used to increase the immunological 
response, depending on the host species, including but not limited to Freund's 
(complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysplecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful 
human adjuvants such as BCG (bacille Calmette-Gueriri) and Corynebacterium 
parvum. 

For preparation of monoclonal antibodies directed toward the ob polypeptide, or 
fragment, analog, or derivative thereof, any technique that provides for the 
production of antibody molecules by continuous cell lines in culture may be used. 
These include but are not limited to the hybridoma technique originally developed 
by Kohler and Milstein (1975, Nature 256:495-497), as well as the trioma 
technique, the human B-cell hybridoma technique (Kozbor et al., 1983, 
Immunology Today 4:72), and the EBV-hybridoma technique to produce human 
monoclonal antibodies (Cole et ah, 1985, in Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). In an additional embodiment of the 
invention, monoclonal antibodies can be produced in germ-free animals utilizing 
recent technology (PCT/US90/02545). According to the invention, human 
antibodies may be used and can be obtained by using human hybridomas (Cote et 
al., 1983, Proc. Nad. Acad. Sci. U.S.A. 80:2026-2030) or by transforming human 
B cells with EBV virus in vitro (Cole et al., 1985, in Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, pp. 77-96). In fact, according to the invention, 
techniques developed for the production of "chimeric antibodies" (Morrison et al., 
1984, J. Bacteriol. 159-870; Neuberger et al., 1984, Nature 312:604-608; Takeda 
et al., 1985, Nature 314:452-454) by splicing the genes from a mouse antibody 
molecule specific for an ob polypeptide together with genes from a human 
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antibody molecule of appropriate biological activity can be used; such antibodies 
are within the scope of this invention. Such human or humanized chimeric 
antibodies are preferred for use in therapy of human diseases or disorders 
(described infra), since the human or humanized antibodies are much less likely 
5 than xenogenic antibodies to induce an immune response, in particular an allergic 
response, themselves. 

According to the invention, techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce ob polypep tide- 
specific single chain antibodies. An additional embodiment of the invention 
10 utilizes the techniques described for the construction of Fab expression libraries 

□ (Huse et ah, 1989, Science 246:1275-1281) to allow rapid and easy identification 

P of monoclonal Fab fragments with the desired specificity for an ob polypeptide, or 

jp its derivatives, or analogs. 

01 

?! Antibody fragments which contain the idiotype of the antibody molecule can be 

5 15 generated by known techniques. For example, such fragments include but are not 

p limited to: the F(ab') 2 fragment which can be produced by pepsin digestion of the 

antibody molecule; the Fab' fragments which can be generated by reducing the 
disulfide bridges of the F(ab') 2 fragment, and the Fab fragments which can be 
generated by treating the antibody molecule with papain and a reducing agent. 

In the production of antibodies, screening for the desired antibody can be 
accomplished by techniques known in the art, e.g., radioimmunoassay, ELISA 
(enzyme-linked immunosorbent assay), "sandwich" immunoassays, 
immunoradiometric assays, gel diffusion precipitin reactions, immunodiffusion 
assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope labels, 
for example), western blots, precipitation reactions, agglutination assays (e.g., gel 
agglutination assays, hemagglutination assays), complement fixation assays, 
immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, 
etc. In one embodiment, antibody binding is detected by detecting a label on the 
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primary antibody. In another embodiment, the primary antibody is detected by 
detecting binding of a secondary antibody or reagent to the primary antibody. In a 
further embodiment, the secondary antibody is labeled. Many means are known in 
the art for detecting binding in an immunoassay and are within the scope of the 
present invention. For example, to select antibodies which recognize a specific 
epitope of an ob polypeptide, one may assay generated hybridomas for a product 
which binds to an ob polypeptide fragment containing such epitope. For selection 
of an antibody specific to an ob polypeptide from a particular species of animal, 
one can select on the basis of positive binding with ob polypeptide expressed by or 
isolated from cells of that species of animal. 

The foregoing antibodies can be used in methods known in the art relating to the 
localization and activity of the ob polypeptide, e.g., for Western blotting, imaging 
ob polypeptide in situ, measuring levels thereof in appropriate physiological 
samples, etc. 

In a specific embodiment, antibodies that agonize or antagonize the activity of ob 
polypeptide can be generated. Such antibodies can be tested using the assays 
described infra for identifying ligands. 

Immortal, antibody-producing cell lines can also be created by techniques other 
than fusion, such as direct transformation of B lymphocytes with oncogenic DNA, 
or transfection with Epstein-Barr virus. See, e.g., M. Schreier et al., "Hybridoma 
Techniques" (1980); Hammerling et al., "Monoclonal Antibodies And T-cell 
Hybridomas" (1981); Kennett et al., "Monoclonal Antibodies" (1980); see also 
U.S. Patent Nos. 4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,451,570; 
4,466,917; 4,472,500; 4,491,632; 4,493,890. 

In a specific embodiment, antibodies are developed by immunizing rabbits with 
synthetic peptides predicted by the protein sequence or with recombinant proteins 
made using bacterial expression vectors. The choice of synthetic peptides is made 
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after careful analysis of the predicted protein structure, as described above. In 
particular, peptide sequences between putative cleavage sites are chosen. Synthetic 
peptides are conjugated to a carrier such as KLH hemocyanin or BSA using 
carbodiimide and used in Freunds adjuvant to immunize rabbits. In order to 
prepare recombinant protein, the gex vector can be used to express the polypeptide 
(Smith and Johnson, supra). Alternatively, one can use only hydrophilic domains 
to generate the fusion protein. The expressed protein will be prepared in quantity 
and used to immunize rabbits in Freunds adjuvant. 

The presence of weight modulator in cells can be ascertained by the usual 
immunological procedures applicable to such determinations. A number of useful 
procedures are known. Three such procedures which are especially useful utilize 
either the receptor recognition factor labeled with a detectable label, antibody Ab A 
labeled with a detectable label, or antibody Ab 2 labeled with a detectable label. 
The procedures may be summarized by the following equations wherein the 
asterisk indicates that the particle is labeled, and "WM" stands for the weight 
modulator: 

A. WM* + Ab x = WM*Ab x 

B. WM + Ab* = WMAbi* 

C. WM + Ab x + Ab 2 * = Ab x WMAb 2 * 

The procedures and their application are all familiar to those skilled in the art and 
accordingly may be utilized within the scope of the present invention. The 
"competitive" procedure, Procedure A, is described in U.S. Patent Nos. 3,654,090 
and 3,850,752. Procedure B is representative of the well known competitive assay 
techniques. Procedure C, the "sandwich" procedure, is described in U.S. Patent 
Nos. RE 31,006 and 4,016,043. Still other procedures are known such as the 
"double antibody", or "DASP" procedure. 

In each instance, the weight modulators form complexes with one or more 
antibody(ies) or binding partners and one member of the complex is labeled with a 
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detectable label. The fact that a complex has formed and, if desired, the amount 
thereof, can be determined by known methods applicable to the detection of labels. 

It will be seen from the above, that a characteristic property of Ab^ is that it will 
react with Ab^ This is because Ab x raised in one mammalian species has been 
5 used in another species as an antigen to raise the antibody Ab 2 . For example, Ab 2 
may be raised in goats using rabbit antibodies as antigens. Ab 2 therefore would be 
anti-rabbit antibody raised in goats. For purposes of this description and claims, 
Ab x will be referred to as a primary or anti-weight modulator antibody, and Ab 2 
will be referred to as a secondary or anti-A^ antibody. 

10 The labels most commonly employed for these studies are radioactive elements, 
enzymes, chemicals which fluoresce when exposed to ultraviolet light, and others. 

A number of fluorescent materials are known and can be utilized as labels. These 
include, for example, fluorescein, rhodamine and auramine. A particular detecting 
material is anti-rabbit antibody prepared in goats and conjugated with fluorescein 
15 through an isothiocyanate. 

The weight modulators or their binding partners can also be labeled with a 
radioactive element or with an enzyme. The radioactive label can be detected by 
any of the currently available counting procedures. The preferred isotope may be 
selected from 3 H, 14 C, 32 P, 35 S, 36 C1, 51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, 125 I, 131 I, and 
20 186 Re. 

Enzyme labels are likewise useful, and can be detected by any of the presently 
utilized colorimetric, spectrophotometric, fluorospectrophotometric, amperometric 
or gasometric techniques. The enzyme is conjugated to the selected particle by 
reaction with bridging molecules such as carbodiimides, diisocyanates, 
25 glutaraldehyde and the like. Many enzymes which can be used in these 
procedures are known and can be utilized. The preferred are peroxidase, 
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li-glucuronidase, ii-D-glucosidase, B-D-galactosidase, urease, glucose oxidase plus 
peroxidase and alkaline phosphatase. U.S. Patent Nos. 3,654,090; 3,850,752; and 
4,016,043 are referred to by way of example for their disclosure of alternate 
labeling material and methods. 

5 A particular assay system that is to be utilized in accordance with the present 
invention, is known as a receptor assay. In a receptor assay, the material to be 
assayed is appropriately labeled and then certain cellular test colonies are 
inoculated with a quantity of both the labeled and unlabeled material after which 
binding studies are conducted to determine the extent to which the labeled material 
10 binds to the cell receptors. In this way, differences in affinity between materials 
can be ascertained. 

Accordingly, a purified quantity of the weight modulator may be radiolabeled and 
combined, for example, with antibodies or other inhibitors thereto, after which 
binding studies would be carried out. Solutions would then be prepared that 

15 contain various quantities of labeled and unlabeled uncombined weight modulator, 
and cell samples would then be inoculated and thereafter incubated. The resulting 
cell monolayers are then washed, solubilized and then counted in a gamma counter 
for a length of time sufficient to yield a standard error of < 5 % . These data are 
then subjected to Scatchard analysis after which observations and conclusions 

20 regarding material activity can be drawn. While the foregoing is exemplary, it 
illustrates the manner in which a receptor assay may be performed and utilized, in 
the instance where the cellular binding ability of the assayed material may serve as 
a distinguishing characteristic. In turn, a receptor assay will be particularly useful 
in the identification of the specific receptors to the present modulators, such as the 

25 db receptor. 



A further assay useful and contemplated in accordance with the present invention 
is known as a "cis/trans" assay. Briefly, this assay employs two genetic 
constructs, one of which is typically a plasmid that continually expresses a 



03 
m 



48 

particular receptor of interest when transfected into an appropriate cell line, and 
the second of which is a plasmid that expresses a reporter such as luciferase, 
under the control of a receptor/Iigand complex. Thus, for example, if it is desired 
to evaluate a compound as a ligand for a particular receptor, one of the plasmids 
5 would be a construct that results in expression of the receptor in the chosen cell 
line, while the second plasmid would possess a promoter linked to the luciferase 
gene in which the response element to the particular receptor is inserted. If the 
compound under test is an agonist for the receptor, the ligand will complex with 
the receptor, and the resulting complex will bind the response element and initiate 
10 transcription of the luciferase gene. The resulting chemiluminescence is then 

measured photometrically, and dose response curves are obtained and compared to 
those of known ligands. The foregoing protocol is described in detail in U.S. 
Patent No. 4,981,784 and PCT International Publication No. WO 88/03168, for 
which purpose the artisan is referred. 

15 In a further embodiment of this invention, commercial test kits suitable for use by 
a medical specialist may be prepared to determine the presence or absence of 
predetermined transcriptional activity or predetermined transcriptional activity 
capability in suspected target cells. In accordance with the testing techniques 
discussed above, one class of such kits will contain at least the labeled weight 

20 modulator or its binding partner, for instance an antibody specific thereto, and 
directions, of course, depending upon the method selected, e.g., "competitive", 
"sandwich", "DASP" and the like. The kits may also contain peripheral reagents 
such as buffers, stabilizers, etc. 

Accordingly, a test kit may be prepared for the demonstration of the presence or 
25 capability of cells for predetermined transcriptional activity, comprising: 

(a) a predetermined amount of at least one labeled immunochemically reactive 
component obtained by the direct or indirect attachment of the present weight 
modulator or a specific binding partner thereto, to a detectable label; 

(b) other reagents; and 
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(c) directions for use of said kit. 

More specifically, the diagnostic test kit may comprise: 

(a) a known amount of the weight modulator as described above (or a binding 
partner) generally bound to a solid phase to form an immunosorbent, or in the 
alternative, bound to a suitable tag, or plural such end products, etc. (or their 
binding partners) one of each; 

(b) if necessary, other reagents; and 

(c) directions for use of said test kit. 

In a further variation, the test kit may be prepared and used for the purposes stated 
above, which operates according to a predetermined protocol (e.g. "competitive", 
"sandwich", "double antibody", etc.), and comprises: 

(a) a labeled component which has been obtained by coupling the weight 
modulator to a detectable label; 

(b) one or more additional immunochemical reagents of which at least one 
reagent is a ligand or an immobilized ligand, which ligand is selected from the 
group consisting of: 

(i) a ligand capable of binding with the labeled component (a); 
* (ii) a ligand capable of binding with a binding partner of the labeled 

component (a); 

(iii) a ligand capable of binding with at least one of the component(s) to 
be determined; and 

(iv) a ligand capable of binding with at least one of the binding partners 
of at least one of the component(s) to be determined; and 

(c) directions for the performance of a protocol for the detection and/or 
determination of one or more components of an immunochemical reaction between 
the weight modulator and a specific binding partner thereto. 

In accordance with the above, an assay system for screening potential drugs 
effective to mimic or antagonize the activity of the weight modulator may be 
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prepared. The weight modulator may be introduced into a test system, and the 
prospective drug may also be introduced into the resulting cell culture, and the 
culture thereafter examined to observe any changes in the activity of the cells, due 
either to the addition of the prospective drug alone, or due to the effect of added 
5 quantities of the known weight modulator. 

As stated earlier, the molecular cloning of the ob gene described herein has led to 
the identification of a class of materials that function on the molecular level to 
modulate mammalian body weight. The discovery of the modulators of the 
invention has important implications for the diagnosis and treatment of nutritional 
disorders including, but not limited to, obesity, weight loss associated With cancer 
and the treatment of diseases associated with obesity such as hypertension, heart 
disease and Type II diabetes. In addition, there are potential agricultural uses for 
the gene product in cases where one might wish to modulate the body weight of 
domestic animals. Finally, to the extent that one or more of the modulators of the 
invention are secreted molecules, they can be used biochemically to isolate their 
receptor using the technology of expression cloning. The discussion that follows 
with specific reference to the ob gene bears general applicability to the class of 
modulators that a part of the present invention, and is therefore to be accorded 
such latitude and scope of interpretation. 

Therapeutic Implications 
In the simplest analysis the ob gene determines body weight in mammals, in 
particular mice and man. The ob gene and, correspondingly, cognate molecules, 
appear to be part of a signaling pathway by which adipose tissue communicates 
25 with the brain and the other organs. It is believed that the ob polypeptide is itself 
a signaling molecule, i.e., a hormone. Alternatively ob may be responsible for 
the generation of a metabolic signal, e.g., a stimulating hormone or an enzyme 
that catalyzes activation or synthesis of a peptide or steroid hormone. The most 
important piece of information for distinguishing between these possibilities or 
30 considering alternative hypothesis, is the complete DNA sequence of the RNA and 
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its predicted protein sequence. Irrespective of its biochemical function the genetic 
data suggest that increased activity of ob would result in weight loss while 
decreased activity would be associated with weight gain. The means by which the 
activity of ob can be modified so as to lead to a therapeutic effect depends on its 
biochemical function. 

Administration of recombinant ob polypeptide can result in weight loss. 
Recombinant protein can be prepared using standard bacterial and/or mammalian 
expression vectors, all as stated in detail earlier herein. Reduction of ob 
polypeptide activity (by developing antagonists, inhibitors, or antisense molecules) 
should result in weight gain as might be desirable for the treatment of the weight 
loss associated with cancer, AIDS or anorexia nervosa. Modulation of ob activity 
can be useful for reducing body weight (by increasing its activity) or increasing 
body weight (by decreasing its activity). 

The ob polypeptide, or functionally active fragment thereof, or an antagonist 
thereof, can be administered orally or parenterally, preferably parenterally. 
Because metabolic homeostasis is a continuous process, controlled release 
administration of ob polypeptide is preferred. For example, the polypeptide may 
be administered using intravenous infusion, an implantable osmotic pump, a 
transdermal patch, liposomes, or other modes of administration. In one 
embodiment, a pump may be used (see Langer, supra : Sefton, CRC Crit. Ref. 
Biomed. Eng. 14:201 (1987); Buchwald et al., Surgery 88:507 (1980); Saudek et 
al., N. Engl. J. Med. 321:574 (1989)). In another embodiment, polymeric 
materials can be used (see Medical Applications of Controlled Release, Langer and 
Wise (eds.), CRC Pres., Boca Raton, Florida (1974); Controlled Drug 
Bioavailability, Drug Product Design and Performance, Smolen and Ball (eds.), 
Wiley, New York (1984); Ranger and Peppas, /. Macromol. Sci. Rev. MacromoL 
Chem. 23:61 (1983); see also Levy et al., Science 228:190 (1985); During et al., 
Ann. Neurol. 25:351 (1989); Howard et al., /. Neurosurg. 71:105 (1989)). In yet 
another embodiment, a controlled release system can be placed in proximity of the . 
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therapeutic target, i.e., the brain, thus requiring only a fraction of the systemic 
dose (see, e.g., Goodson, in Medical Applications of Controlled Release, supra , 
vol. 2, pp. 115-138 (1984)). Other controlled release systems are discussed in the 
review by Langer {Science 249:1527-1533 (1990)). In another embodiment, the 
therapeutic compound can be delivered in a vesicle, in particular a liposome (see 
Langer, Science 249: 1527-1533 (1990); Treat et al., in Liposomes in the Therapy 
of Infectious Disease and Cancer, Lopez-Berestein and Fidler (eds.), Liss, New 
York, pp. 353-365 (1989); Lopez-Berestein, ibid., pp. 317-327; see generally 
ibid .) 

In a further aspect, recombinant cells that have been transformed with the ob gene 
and that express high levels of the polypeptide can be transplanted in a subject in 
need of ob polypeptide. Preferably autologous cells transformed with ob are 
transplanted to avoid rejection; alternatively, technology is available to shield non- 
autologous cells that produce soluble factors within a polymer matrix that prevents 
immune recognition and rejection. 

Thus, the ob polypeptide can be delivered by intravenous, intraarterial, 
intraperitoneal, intramuscular, or subcutaneous routes of administration. 
Alternatively, the ob polypeptide, properly formulated, can be administered by 
nasal or oral administration. A constant supply of ob can be ensured by providing 
a therapeutically effective dose (i.e. , a dose effective to induce metabolic changes 
in a subject) at the necessary intervals, e.g., daily, every 12 hours, etc. These 
parameters will depend on the severity of the disease condition being treated, other 
actions, such as diet modification, that are implemented, the weight, age, and sex 
of the subject, and other criteria, which can be readily determined according to 
standard good medical practice by those of skill in the art. 

Alternatively, the ob gene could be introduced into human fat cells to develop 
gene therapy for obesity. Such therapy would be expected to decrease body 
weight. Conversely, introduction of antisense constructs into human fat cells 
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would reduce the levels of active ob polypeptide and would be predicted to 
increase body adiposity. 



If ob is an enzyme, strategies have begun to be developed for the identification of 
the substrate and product of the catalyzed reaction that would make use of the 
5 recombinant protein. The rationale for this strategy is as follows: If ob is an 
enzyme that catalyzes a particular reaction in adipose tissue, then fat cells from ob 
mice should have high levels of the substrate and very little product. Since it is 
hypothesized that db mice are resistant to the product of this reaction, fat cells 
from db mice should have high levels of the reaction product. Thus, comparisons 
10 of lipid and peptide extracts of ob and db adipose tissue using gas chromatography 
or other chromatographic methods should allow the identification of the product 
and substrate of the key chemical reaction. The prediction would be that the 
recombinant ob protein would catalyze this reaction. The product of this reaction 
would then be a candidate for a signaling molecule that modulates body weight. 

15 As noted above, the functional activity of the ob polypeptide can be effected 

transgenically,, e.g. y by gene therapy. In this respect, a transgenic mouse model 
can be used. The ob gene can be used in complementation studies employing 
transgenic mice. Transgenic vectors, including viral vectors, or cosmid clones (or 
phage clones) corresponding to the wild type locus of candidate gene, can be 

20 constructed using the isolated ob gene. Cosmids may be introduced into 

transgenic mice using published procedures (Jaenisch, Science 240, 1468-1474, 
1988). The constructs are introduced into fertilized eggs derived from an 
intercross between Fl progeny of a C57BL/6J oblob X DBA intercross. These 
crosses require the use of C57BL/6J oblob ovarian transplants to generate the Fl 

25 animals. DBA/2J mice are used as the counters train because they have a 
nonagouti coat color which is important when using the ovarian transplants. 
Genotype at the ob loci in cosmid transgenic animals can be determined by typing 
animals with tightly linked RFLPs or microsatellites which flank the mutation and 
which are polymorphic between the progenitor strains. Complementation will be 
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demonstrated when a particular construct renders a genetically obese F2 animal (as 
scored by RFLP analysis) lean and nondiabetic. Under these circumstances, final 
proof of complementation will require that the oblob or. dbldb animal carrying the 
transgene be mated to the oblob or dbldb ovarian transplants. In this cross, all N2 
animals which do not carry the transgene will be obese and insulin 
resistant/diabetic, while those that do carry the transgene will be lean and have 
normal glucose and insulin concentrations in plasma. In a genetic sense, the 
transgene acts as a suppressor mutation. 

Alternatively, ob genes can be tested by examining their phenotypic effects when 
express in antisense orientation in wild-type animals. In this approach, expression 
of the wild type allele is suppressed, which leads to a mutant phenotype. 
RNA*RNAduplex formation (antisense- sense)prevents normal handling of 
mRNA, resulting in partial or complete elimination of wild-type gene effect. This 
technique has been used to inhibit Tk synthesis in tissue culture and to produce 
phenotypes of the Kruppel mutation in Drosophila, and the shiverer mutation in 
mice (Izant and Weintraub, Cell 36, 1007-1015, 1984; Green et al., Annu. Rev. 
Biochem. 55,569-597, 1986; Katsuki et aL, Science 241, 593-595, 1988). An 
important advantage of this approach is that only a small portion of the gene need 
be expressed for effective inhibition of expression of the entire cognate mRNA. 
The antisense transgene will be placed under control of its own promoter or 
another promoter expressed in the correct cell type, and placed upstream of the 
SV40 poly A site. This transgene will be used to make transgenic mice. 
Transgenic mice will also be mated ovarian transplants to test whether ob 
heterozygotes are more sensitive to the effects of the antisense construct. 

In the long term, the elucidation of the biochemical function of the ob gene 
product (the ob polypeptide or protein) is useful for identifying small molecule 
agonists and antagonists that affect its activity. 
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Diagnostic Implications 
The human cDNA clones that have recently been isolated have been sequenced as 
presented herein. This facilitates the determination of the complete sequence of 
the human gene (see Figure 20). DNA sequences from the introns of the human 
ob gene have been obtained (Figure 20), and these have been used to prepare PCR 
primers to PCR amplify the coding sequence of the ob gene from human genomic 
DNA so as to identify mutations or allelic variants of the ob gene, all in 
accordance with protocols described in detail earlier herein. Specific PCR primers 
for amplifying human genomic ob are described in a specific Example, infra. 

The current hypothesis is that heterozygous mutations in the ob gene will be 
associated with mild/moderate obesity while homozygous mutations would be 
associated with several DNA sequence based diagnostic tests obesity. If this is 
true, it would allow the ascertainment of people at risk for the development of 
obesity and make possible the application of drug treatment and/or lifestyle 
changes before an increased body weight is fully developed. 

Alternatively, the presence of microsatellites that segregate with mutant forms of 
human ob can be used for diagnosis. Various PCR primers, including those based 
on the nucleotide sequence provided in Figure 20A, can be used in this respect. 

The ob gene may also be useful diagnostically for measurements of its encoded 
RNA and protein in nutritional disorders. It will be of importance to know, in a 
particular nutritional disorder, whether ob RNA and/or protein is unregulated or 
downregulated. Thus, if an obese person has increased levels of ob, it would 
appear that the problem is downstream of ob, while if ob is reduced, it would 
appear that inappropriately low levels of ob may be cause of obesity (whether or 
not the defect is in the ob gene). Conversely, if a cancer or AIDS patient who 
lost weight had elevated levels of ob, it may be concluded that inappropriately 
high expression of ob is responsible for the weight loss. 
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The cloned human cDNA will be of use for the measurement of the levels of 
human ob RNA. In addition, recombinant human protein will be prepared and 
used to develop immunoassays to enable measurement of the fat and perhaps 
plasma levels of the ob protein. 

Agricultural Applications 
The ob gene can also be isolated from domestic animals, and the corresponding ob 
polypeptide obtained thereby. In a specific example, infra, the a probe derived 
from the murine ob gene hybridizes to corresponding homologous coding 
sequences from a large number of species of animals. As discussed for human 
therapies, recombinant proteins can also be prepared and administered to domestic 
animals. Administration of the polypeptide can be implemented to produce leaner 
food animals, such as beef cattle, swine, poultry, sheep, etc. Preferably, an . 
autologous ob polypeptide is administered, although the invention contemplates 
administration of anti-autologous polypeptide as well. Since the ob polypeptide 
consists of approximately 160 amino acid residues, it may not be highly 
immunogenic. Thus, administration of non-autologous polypeptide may not result 
in an immune response. 

Alternatively, the introduction of the cloned genes into transgenic domestic 
animals would allow one to potentially decrease body weight and adiposity by 
overexpressing an ob transgene. The simplest means of achieving this would be to 
target an ob transgene to fat using its own or another fat specific promoter. 

Conversely, increases in body fat might be desirable in other circumstances such 
as for the development of Kobe beef or fatty liver to make foie gras. This could 
be accomplished by targeting an antisense ob transgene to fat, or by using gene 
knockout technology. Alternatively, where an increase in body weight at 
percentage of fat is desired, an inhibitor or antagonist of the ob polypeptide can be 
administered. Such inhibitors or antagonists include, but are not limited to, 
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antibodies reactive with the polypeptide, and fragments of the polypeptide that 
bind but do not activate the ob receptor, i.e., antagonists of ob polypeptide. 

The ob Receptor 

Development of small molecule agonists and antagonists of the ob factor will be 
greatly facilitated by the isolation of its receptor. This can be accomplished by 
preparing active ob polypeptide and using it to screen an expression library using 
standard methodology. Receptor binding in the expression library can be tested by 
administering recombinant polypeptide prepared using either bacterial or 
mammalian expression vectors, and observing the effects of short term and 
continuous administration of the recombinant polypeptide on the cells of the 
expression library, or by directly detecting binding of ob polypeptide to the cells. 

As it is presently believed that the ob receptor is likely to be located in the 
hypothalamus and perhaps liver, preferably cDNA libraries from these tissues will 
be constructed in standard expression cloning vectors. These cDNA clones would 
next be introduced into COS cells as pools and the resulting transformants would 
be screened with active ligand to identify COS cells expressing the ob receptor. 
Positive clones can then be isolated so as to recover the cloned receptor. The 
cloned receptor would be used in conjunction with the ob ligand (assuming it is a 
hormone) to develop the necessary components for screening of small molecule 
modulators of ob. 

Once a recombinant which expresses the ob receptor gene sequence is identified, 
the recombinant ob receptor can be analyzed. This is achieved by assays based on 
the physical or functional properties of the ob receptor, including radioactive 
labelling of the receptor followed by analysis by gel electrophoresis, immunoassay, 
ligand binding, etc. Furthermore, antibodies to the ob receptor could be generated 
as described above. 
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The structure of the ob receptor can be analyzed by various methods known in the 
art. Preferably, the structure of the various domains, particularly the ob binding 
site, is analyzed. Structural analysis can be performed by identifying sequence 
similarity with other known proteins, particular hormone and protein receptors. 
5 The degree of similarity (or homology) can provide a basis for predicting structure 
and function of the ob receptor, or a domain thereof. In a specific embodiment, 
sequence comparisons can be performed with sequences found in GenBank, using, 
for example, the FASTA and FASTP programs (Pearson and Lipman, 1988, Proc. 
Natl. Acad. Sci. USA 85:2444-48). 

10 The protein sequence can be further characterized by a hydrophilicity analysis 
{e.g., Hopp and Woods, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:3824). A 
hydrophilicity profile can be used to identify the hydrophobic and hydrophilic 
regions of the ob receptor protein, which may in turn indicate extracytoplasmic, 
membrane binding, and intracytoplasmic regions. 

15 Secondary structural analysis {e.g., Chou and Fasman, 1974, Biochemistry 
13:222) can also be done, to identify regions of the ob receptor that assume 
specific secondary structures. 

Manipulation, translation, and secondary structure prediction, as well as open 
reading frame prediction and plotting, can also be accomplished using computer 
20 software programs available in the art. 

By providing an abundant source of recombinant ob polypeptide, and the 
opportunity to isolate the ob receptor {i.e., the db gene product), the present 
invention enables quantitative structural determination of the active conformation 
of the ob polypeptide and the ob receptor, or domains thereof. In particular, 
25 enough material is provided for nuclear magnetic resonance (NMR), infrared (IR), 
Raman, and ultraviolet (UV), especially circular dichroism (CD), spectroscopic 
analysis. In particular NMR provides very powerful structural analysis of 
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molecules in solution, which more closely approximates their native environment 
(Marion et al., 1983, Biochem. Biophys. Res. Comm. 113:967-974; Bar et al., 
1985, J. Magn. Reson. 65:355-360; Kimura et al., 1980, Proc. Natl. Acad. Sci. 
U.S.A. 77:1681-1685). Other methods of structural analysis can also be 
employed. These include but are not limited to X-ray crystallography (Engstpm, 
A., 1974, Biochem. Exp. Biol. 11:7-13). 

More preferably, co-crystals of ob polypeptide and ob receptor can be studied. 
Analysis of co-crystals provides detailed information about binding, which in turn 
allows for rational design of ligand agonists and antagonists. Computer modeling 
can also be used, especially in connection with NMR or X-ray methods 
(Fletterick, R. and Zoller, M. (eds.), 1986, Computer Graphics and Molecular 
Modeling, in Current Communications in Molecular Biology, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York). 

Identification and isolation of a gene encoding an ob receptor of the invention 
provides for expression of the receptor in quantities greater than can be isolated 
from natural sources, or in indicator cells that are specially engineered to indicate 
the activity of a receptor expressed after trans fection or transformation of the cells. 
According, in addition to rational design of agonists and antagonists based on the 
structure of ob polypeptide, the present invention contemplates an alternative 
method for identifying specific ligands of ob receptor using various screening 
assays known in the art. 

Any screening technique known in the art can be used to screen for ob receptor 
agonists or antagonists. The present invention contemplates screens for small 
molecule ligands or ligand analogs and mimics, as well as screens for natural 
ligands that bind to and agonize or antagonize activates ob receptor in vivo. 

Knowledge of the primary sequence of the receptor, and the similarity of that 
sequence with proteins of known function, can provide an initial clue as the 
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inhibitors or antagonists of the protein. Identification and screening of antagonists 
is further facilitated by determining structural features of the protein, e.g., using 
X-ray crystallography, neutron diffraction, nuclear magnetic resonance 
spectrometry, and other techniques for structure determination. These techniques 
provide for the rational design or identification of agonists and antagonists. 

Another approach uses recombinant bacteriophage to produce large libraries. 
Using the "phage method" (Scott and Smith, 1990, Science 249:386-390; Cwirla, 
et al., 1990, Proc. Natl. Acad. Sci., 87:6378-6382; Devlin et al., 1990, Science, 
249:404-406), very large libraries can be constructed (lOMO 8 chemical entities). 
A second approach uses primarily chemical methods, of which the Geysen method 
(Geysen et al., 1986, Molecular Immunology 23:709-715; Geysen et al. 1987, J. 
Immunologic Method 102:259-274) and the recent method of Fodor et al. (1991, 
Science 251, 767-773) are examples. Furka et al. (1988, 14th International 
Congress of Biochemistry, Volume 5, Abstract FR:013; Furka, 1991, Int. J. 
Peptide Protein Res. 37:487-493), Houghton (U.S. Patent No. 4,631,211, issued 
December 1986) and Rutter et al. (U.S. Patent No. 5,010,175, issued April 23, 
1991) describe methods to produce a mixture of peptides that can be tested ais 
agonists or antagonists. 

In another aspect, synthetic libraries (Needels et al., 1993, "Generation and 
screening of an oligonucleotide encoded synthetic peptide library," Proc. Natl. 
Acad. Sci. USA 90:10700-4; Lam et al., International Patent Publication No. WO 
92/00252, each of which is incorporated herein by reference in its entirety), and 
the like can be used to screen for ob receptor ligands according to the present 
invention. With such libraries, receptor antagonists can be detected using cell that 
express the receptor without actually cloning the ob receptor (Lam et al., supra). 

Alternatively, assays for binding of soluble ligand to cells that express 
recombinant forms of the ob receptor ligand binding domain can be performed. 
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The soluble ligands can be provided readily as recombinant or synthetic ob 
polypeptide. 
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The screening can be performed with recombinant cells that express the ob 
receptor, or alternatively, using purified receptor protein, e.g., produced 
5 recombinantly, as described above. For example, the ability of labeled, soluble or 
solubilized ob receptor that includes the ligand-binding portion of the molecule, to 
bind ligand can be used to screen libraries, as described in the foregoing 
references. ' 

EXAMPLE SECTION 

10 The following outlines the method used to identify the genetic material that is 
exemplary of the present invention. This endeavor comprises four sequential 
steps: A) Genetic Mapping, B) Physical Mapping, C) Candidate Gene Isolation, 
and D) Mutation detection. Following confirmation that the murine gene in object 
was isolated (Step D), the homologous human gene was sought, and both the 

15 murine and human genes and putative proteins were characterized. The steps are 
summarized in greater detail, below. 

A. Genetic Mapping 

The ob mutation was segregated in genetic crosses and standard linkage analysis 
was used to position the mutation relative to RFLPs (restriction fragment length 

20 polymorphisms). These data placed the ob gene in an — 5cM interval on 
proximal mouse chromosome 6. (5cM is a measurement of genetic distance 
corresponding to 5 apparent genetic crossovers per 100 animals.) A total of 771 
informative meioses were generated and used in subsequent genetic mapping 
(Friedman et al. Genomics 11: 1054-1062, 1991). The genetic loci that were 

25 mapped relative to ob were all previously published. The two closest RFLPs 
described were defined by probes derived from the carboxypeptidase and met 
oncogene genes. 
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The genetic resolution of the experiments described above was inadequate to clone 
ob, principally because none of the genetic markers were in tight linkage. In 
order to identify the requisite tightly linked RFLPs, additional probes were 
isolated and the genetic cross was expanded. A method known, as chromosome 
5 microdissection was used to isolate random pieces of DNA from proximal mouse 
chromosome 6 (Bahary et al., Mammalian Genome 4: 511-515, 1993). Individual 
cloned probes were tested for tight linkage to ob. On the basis of these studies 
one probe, D6Rckl3, also termed psd3, was selected for further analysis owing to 
its genetic proximity to ob. 

This probe was used to genotype 835 ob progeny from interspecific and 
intersubspecific crosses, which indicated that D6Rckl3 is nonrecombinant in all 
835 animals as reported in Bahary et al. In the course of physical mapping, a new 
polymorphic marker was identified from a cosmid subclone derived from YAC 
53A6. This new marker was positioned between D6Rckl3 and the ob gene and 
was used to genotype the additional 771 informative meioses from intraspecific 
intercross and backcross. A single animal #167 was identified to bear a 
recombination crossover between ob and D6Rck39. These studies indicated that 
D6Rck39/D6RcK13 is -0.06 cM from ob. An additional probe, Pax-4, was 
identified that was . 12 cM proximal to ob. Pax-4 was recombinant in two 
animals; #111 and 420. Pax-4 is a pseudogene that was previously mapped to 
proximal mouse chromosome 6 by Gruss and co-workers (Gruss et al. Genomics 
11:424-434, 1991). On this basis, it was determined that the ob gene resides in 
the — 0.2cM interval between Pax-4 and D6Rckl3. This led to efforts to clone 
the interposing DNA in an effort to isolate ob. 

25 IL Physical Mapping 

The cloning of the DNA in this interval made use of yeast artificial chromosomes 
(YACs), a relatively new cloning vector that allows the cloning of long stretches 
of contiguous DNA often more than 1 million base pairs in length. 
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Firstly, yeast artificial chromosomes were isolated using D6Rckl3 and Pax-4. 
This was accomplished by preparing purified DNA probes and using them to 
isolate the corresponding YACs. These YACs (#8, 16, 107 and 24) were isolated 
and initially characterized, and on the basis of the resulting analyses it was 
concluded that YAC 16 was the YAC that extended furthest distally, i.e., closest 
to ob. The key end of YAC #16 was then recovered, and it was determined that 
this end was closer to ob than Pax-4. This end was termed 16M(+). This 
conclusion was reached because it was shown that this probe was not recombinant 
in animal #420 (as was Pax-4). This end was sequenced and used to develop a 
PCR assay. This PCR assay was used to screen a YAC library. Four positive 
clones were isolated. Subsequent characterization of these YACs by end-rescuing, 
restriction mapping, pulse field gel electrophoresis, and Southern blots with the 
genetic crosses determined that two of these YACs, adu and aad, were critical for 
subsequent studies. YAC aad is a 550 kB nonchimeric YAC which extended 
furthest distally. Therefore, the distal end of this YAC, aad(pICL) was used to 
complete the physical map. YAC adu is 370 kB nonchimeric YAC and its distal 
end, adu(+), was determined to be nonrecombinant in all the ob progeny of the 
genetic crosses including animals #111 and 167, suggesting that the ob gene might 
reside in this YAC. 

A PCR assay for these two ends, aad(plCL) and adu(+) was developed and used 
for isolating moire YACs and PI clones to continue physical mapping. The 
important Pi clones isolated by this effort included 498, 499, 500 (isolated using a 
probe derived from aad(plCL)) and 322, 323 and 324 (using a probe from 
adu{+)). 

In the meantime, YACs isolated by D6Rckl3 (53A6, 25A8, 25A9, 25A10) were 
characterized. These studies determined that 53A6 extended furthest proximally 
toward the aad YAC. The size of the gap between 53 A6 and aad was determined 
— 70 kB. The key end of 53 A6, 53(pICL) was then used to screen three available 
YAC libraries and a PI library. A critical PI clone, 325, was isolated. This PI 
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clone overlapped with the PI clones isolated by aad(plCL) as described above, 
and therefore served to close the gap between 53(pICL) and aad(pICL). As a 
result, the whole contig, containing YACs and PI clones, of — 2.5 million base 
pairs in length, and which spanned Pax4, 16M(+), adu(+) y aad(p\CL), 53(pICL), 
D6Rck39 and D6Rckl3, was cloned. By carefully mapping the sites of 
recombination apparent in animal #111 and 167, it was concluded that ob was 
situated in a 400 kB interval. To provide a working DNA source for isolating the 
ob gene, about 500 kB covering this nonrecombination region was isolated in a 
total of 24 PI clones. These PI clones, including 322 and 323, which later were 
proved to be useful clones, were used for exon trapping. 

The physical map of the portion of the chromosome carrying ob is shown in 
Figure 7A. Figure 7B represents the YAC contig. Figure 7C represents the PI 
contig. 

C. Isolation of Candidate Genes 

The method used to isolate genes in this interval was exon trapping. This method 
used a commercial vector to identify exon DNA (i.e., coding sequences) by 
selecting for functional splice acceptor and donor sequences in genomic DNA 
introduced into a test construct. The DNA from these Pis were grown and 
subcloned into the exon trapping vector. These clones were short inserts cloned 
into a Bluescript vector. Each clone was PCR amplified with PCR primers 
corresponding to plasmid sequences that flanked the insert. The PCR 
amplification was performed directly on the bacteria that carried the plasmid. The 
reactions were set up using a Biomek robot. The PCR products were 
electrophoresed on a 1 % agarose gel in TBE buffer that contained ethidium 
bromide. The exon trapping technique was modified to eliminate contaminating E. 
coli DNA from the PI clones, and to screen out the abundant artifactual exons, 
which exceeded 80-90% of the putative exons trapped. The exon trapping vector 
includes HrV sequences; a short segment of these vector sequences corresponds to 
this artifact. 
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The exon trapping experiment was performed using various PI clones. Exon 
trapping products were then amplified by PCR, selected, and sequenced. 
Sequences of putative "exons" were compared with those in Genbank using the 
Blast computer program. About 15 exons were selected for further examination 
5 by RT-PCR, Northern analysis, and zoo blot for the presence of corresponding 
RNA or conservative sequences. Seven of the 15 putative exons, 325-2, 323-9, 
322-5, D1-F7, 1H3, and 2G7, were found to encode an RNA transcript. 325-2 is 
a testis specific gene; 323-8 and 323-9 are likely two exons from the same gene 
expressed mainly in brain and kidney. IH3 and 322-5 represent two low level 
10 brain transcripts. D1-F7 is an exon from a previously cloned gene, inosine 

monophosphate dehydrogenase (IMPDH), which has ubiquitous expression pattern. 
J None of these genes appeared to encode ob. 2G7, which is the ob exon, is 

rJ discussed further below. 

03 

m - 

01 

After three unsuccessful efforts to exon trap the ob gene, another attempt was 
" 4 15 made by pooling DNA from all the Pis from the critical ob region. These 

H» included Pis: 258, 259, 322, 323, 324, 325, 498, 499, 500, 653, 654 and others, 

p; Thereafter Pis 258, 260, 322, 498 and 499 were subcloned into the exon trapping 

y vector, and subsequently several plates were prepared with bacterial clones, each 

O of which carried a putative exon. Approximately 192 clones representing putative 

20 ob candidates were obtained. As noted above, a consistent artifact such that many 
of the isolates contained two trapped exons derived from the vector was observed. 
Thus, clones were identified both by their size and the fact that hybridization of 
DNA probes corresponding to this artifact hybridized to the corresponding bands 
on a Southern blot of the gel. In this way, 185 out of 192 clones were excluded 
25 from further evaluation. Exclusion of the artifacts on the basis of size alone was 
not possible, as this could have, in the end, led to exclusion of the exon 
corresponding to ob. 



Thus, of the 192 exons, a total of seven exons were selected for further study. 
Templates for sequencing the seven exons were prepared, and sequencing was 
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performed. The sequences for the 7 exons were analyzed and it was found that 4 
were identical and one was an apparent artifact. In particular, clone ID 12 
contained the "HIV sequence," i.e., the artifact band. This left three exons for 
further analysis: 1F1, 2G7 and 1H3. 1F1 was eliminated because it mapped 
5 outside the critical region. PCR primers for both 1H3 and 2G7 were selected and 
synthesized. 

The sequence of the exon on 2G7 was determined, and is shown in Figure 10 
(SEQ ID NO:7). PCR primers for 2G7 were selected and synthesized. The 
portions of the sequence corresponding to the PCR primers are underlined. The 
10 primers used were: 

5' CCA GGG CAG GAA AAT GTG (Tm = 60.0) (SEQ ID NO:8) 

3' CAT CCT GGA CTT TCT GGA TAG G (Tm = 60.0) (SEQ ID NO:9) 

These primers amplified genome DNA with PCR conditions as follows: 25-30 
cycles at 55° annealing x 2', 72° extension x 2', 94° denaturation x V in standard 
15 PCR buffer. These primers were also used to generate a labeled probe by 
including 32 P dCTP in the PCR reaction with a corresponding reduction in the 
amount of cold dCTP. 

A RT PCR was performed on a variety of tissue RNAs and it was concluded that 
2G7 was expressed exclusively in white fat among the tissues examined (Figure 

20 11 A). Thereafter, 32 P-labelled 2G7 was hybridized to a Northern blot of tissue 
RNAs (Figure 11B) and showed that its RNA was expressed at high level in fat 
tissue but was either not expressed or eixpressed at very low levels in all other 
tissues (where the signals may be the result of fat contaminating the tissue 
preparations). Ten /ig of total RNA from each of the tissues listed was 

25 electrophoresed on an agarose gel with formaldehyde. The probe was hybridized 
at 65° in a standard hybridization buffer, Rapid Hype (Amersham). The size of 
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the RNA was approximately 4.9 kB. At this point 2G7 was considered to be a 
viable candidate gene for ob and was analyzed further. 

D. Mutation Detection 

In order to confirm that 2G7 encoded the ob gene, it was necessary to demonstrate 
5 differences in the levels of RNA expression of DNA sequence of this gene in 
mutant as compared to wild type animals. Two separate mutations of the ob gene 
are available for study, C57BL/6J ob/ob (1J) and Ckc/Smj ob/db (2J). These will 
be referred hereinafter as 1J and 2J, respectively. (Informal nomenclature is used 
to refer to the mouse strains studied. Throughout this specification and in the 
10 drawings, it will be understood that C57BL/6J refers to C57BL/6J +/+; CKC/smj 
O refers to SM/Ckc-+ DflC - + / + ; CKC/smj ob/ob refers to SM/Ck^+^-ob^/ob 2 *). 

Si RNA was prepared from fat tissue that had been isolated from 1J, 2 J, and control 

02 

m animals. Total RNA for each sample was treated with DNase and then reverse 

Oj transcribed using oligo-dT as a primer and reverse transcriptase. The resulting 

SJ 15 single stranded cDNA was then PCR amplified either with the 2G7 primers 

(conditions shown above) for the lower band or commercially available actin 
H primers for the upper band. The RT PCR products were run on a 1 % agarose 

O TBE gel that was stained with ethidium bromide (Figure 12A). Using RT-PCT it 

5J was found that while 2G7 mRNA was expressed in 1J and all the other control 

20 mice, it was completely missing in 2J mouse. No signal was detected after 30 

cycles of amplification. This experiment provided direct evidence that 2G7 

corresponded to an exon from the ob gene. 

Since 2 J mutation is relatively recent and is maintained as a coisogenic strain, this 
result was the first available evidence that indicated that 2G7 is an exon from the 
25 ob gene. The mutation is likely located in the promoter region which leads to 

total abortion of the mRNA synthesis. The presence of signal in 1 J mouse in this 
RT-PCT experiment suggested that 1J might carry a point mutation which does not 
result in a gross change in size of the RNA sample. In addition, 2G7 mRNA was 
absent, when tested by RT PCR, from four additional 2 J animals. 



# 



68 

This result was confirmed on a Northern blot (Figure 12B). Fat cell RNA was 
prepared from each of the strains (C57B1/6J, 1J, CKC/smj, and 2J). Ten fxg of 
these RNAs were run out. The blot was probed with the 2G7 probe that was PCR 
labeled, by amplification of the material, i.e., band, in Figure 11 using 32 P-dCTP 
5 in the PCR reaction. Actin is a control for the amount of RNA loaded. The actin 
signal is fairly similar in all of the samples. The ob signal is absent in brain 
because the mRNA is specific to fat cells. 

The results of the Northern analysis confirm that 2G7 RNA was absent in 2J mice. 
The ob RNA is absent in the CKC/smj ob/ob mice because in this obese mutant 
10 strain the gene is disrupted such that no RNA is made. In addition, the level of 
Q 2G7 RNA was increased - 10-20 fold in 1J as well as db/db fat. These results 

CP are compatible with the hypothesis that ob either encodes circulating hormone or is 

p responsible for the generation of a signal from fat cells that modulates body 

Sj weight. These results supported the conclusion that 2G7 is the ob gene and 

15 predicted that 1J mice have a point mutation, probably a nonsense mutation 
leading to a premature translation termination. 

These Northern results have been replicated using fat cell RNA preparations from 
four different 2 J animals (Figure 13). In this assay, ap2 is a fat-specific transcript 
that was used as a control much the same as actin in Figure 12B. There is no 
20 significance to the varying density of the ap2 band. ap2 was labeled by designing 
PCR primers form the published ap2 sequence. The RT PCR products of fat cell 
RNA were then relabeled using the same protocol for PCR labeling. This analysis 
demonstrates the presence of ob mRNA in normal homozygous or heterozygous 
animals, and its absence from 2 J mutant animals. 



25 



The mutation has been identified in 1J mice. The mutation is a C to T base 
change that results in change of an arginine to an apparent premature stop codon at 
amino acid 108, and in all likelihood accounts for the 1J mutation (Figure 14) 
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despite high level expression of the ob mRNA (see Figure 12 and 13, C57BL/6J 
ob/ob lanes). 

More recently, Southern blots have been used to conclude that the 2J mutation is 
the result of a detectable DNA change at the 5' end of ob that appears to 
completely abolish RNA expression. The exact nature of this possible 
rearrangement remains to be determined. 

A genomic Southern blot of DNA from the CKC/smj (SM/Ckc--^) and 
C57BL/6J mice using four different restriction endonucleases was performed in 
order to determine whether the mutant ob yielded a unique fragment pattern 
(Figure 15 A). Approximately 10 /xg of DNA (derived from genomic DNA 
prepared from liver, kidney, or spleen) was restriction digested with the restriction 
enzyme indicated. The DNA was then electrophoresed in a 1 % agarose TBE gel. 
The DNA was transferred to an imobilon membrane and hybridized to the VCR 
labeled 2G7 probe. The key band is the uppermost band in the BglH digest for the 
CKC/smj ob/ob (SM/Ckc-+ DAC ob^/ob 2 *) DNA. This band is of higher molecular 
weight than in the other strain, indicating a mutation in this strain. 

Figure 15B is a southern blot of a BglH digest of genomic DNA from the progeny 
of an ob 2 */-*- x ob 2J /+ cross. Some of the DNAs have only the upper band, some 
only the lower band, and some have the both bands. The animals with only the 
upper band are allo-obese, i.e., ob^/ob 21 . These data show that the polymorphism 
{i.e., mutation) shown in Figure 15 A segregates in a genetic sense. 

cDNa Cloning and Sequence Determination of ob 
Using the labeled 2G7 PCR probe, a total of 50 mouse cDNA clones from a 
murine fat cell Xgtll cDNA library (Clonetech 5'-STRETCH cDNA from 
testicular fat pads of Swiss mice, #ML3005b), and thirty cross hybridizing human 
cDNA clones from a human fat cell XgtlO cDNA library (Clonetech 5 '-STRETCH 
cDNA from abdomen #HL1 108a) were isolated. Library screening was performed 
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using the plaque lift procedure. The filters from the plaque lift were denatured 
using the autoclave method. The filters were hybridized in duplicate with the PCR 
labeled 2G7 probe (Rapid Hybe buffer, 65 °C, overnight). After a 2-4 hour 
prehybridization, the filters were washed in 2x SSC, 2% SDS, twice for 30 
minutes at 65°C and exposed to SRy Llim. Duplicate positives were plaque 
purified. Plaque purified phage were PCR amplified using commercially available 
vector primers, e.g., XgtlO and Xgtll. The resulting PCR products corresponded 
to the cDNA insert for each phage with a small amount of vector sequence at 
either end. The bands were gel purified and sequenced using the ABI automated 
sequencer and the vector primers to probe the DNA polymerase. 

The raw sequencing data were then manually examined base by base to correct 
mishearing from the computer program. As the correct sequence became 
available, the downstream primers were synthesized and used to continue 
sequencing. Such experiments were repeated until each available cDNA clone was 
sequenced and synthesized into a contig; To date, —3000 base pairs from the 5' 
end of the mRNA has been compiled. One of the cDNA clones extended to the 5' 
end of the mRNA since its sequence was identical to that of the 5' RACE product 
of fat tissue RNA (data not shown). 

The sequence data revealed that there is a 167 amino acid open reading frame 
(Figure 1). A Kozak translation initiation consensus sequence was present with an 
adenosine residue three bases upstream of the ATG. Two classes of cDNA were 
found differing by inclusion or exclusion of a single glutamine codon. This 
residue is found in a position immediately 3' to the splice acceptor of the 2G7 
exon. Since the CAG codon of glutamine includes a possible AG splice acceptor 
sequence, it appears that there is slippage at the splice acceptor site with an 
apparent 3 base pairs deletion in a subset of the cDNA, as shown below. 
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gin ser val 

ag CAG TCG GTA (with glutamine) (SEQ ID NO: 16) 

t 

(splice acceptor site) 

5 ser val 

ag CAG TCG GTA (without glutamine) (SEQ ID NO: 17) 

t 

(splice acceptor site) 

The "ag" in the sequences above corresponds to the assumed intron sequence 
10 upstream of the glutamine codon, and AG is the putative alternative splice site. 
This glutamine residue is located in a highly conserved region of the molecule and 
p~ its importance for biological activity is as yet unknown. 

01 

03 A putative N-terminal signal sequence was detected, the signal cleavage site of 

which is predicted to be carboxy terminal to the alanine residue at amino acid 
15 position 21. This putative signal sequence was confirmed by application of a 
computer algorithm to the method of von Heijne (Nucl. Acids Res. 14, 4683, 
1986). Using this technique, the most probable signal sequence was identified in 
the polypeptide coding region corresponding to amino acids 1-23, having the 
sequence: 

20 MCWRPLCRFLWLWSYLSYVQA t VP (SEQ ID NO: 10) 

in which the arrow indicates the putative signal sequence cleavage site. The rest 
of the amino acid sequence was largely hydrophilic and did not have any notable 
structural motifs or membrane spanning domains other than the N-terminal signal 
sequence. Specifically, we did not find consensus sequences for N-linked 
25 glycosylation or dibase amino acid sequences indicative of protein cleavage in the 
predicted processed protein (Sabatini and Adesnik, The metabolic basis of inherited 
disease, C.V. Scriver et al. eds., McGraw-Hill: New York, pp. 177-223). Data 
base search using Blast and Block programs did not identify any homologous 
sequence. 
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Human fat tissue RNA was analyzed on Northern blot, RN/C species of similar 
size to the mouse ob gene was detected. Sequencing apd analysis of cDNA clones 
revealed that human ob also encodes 167 amino acia polypeptide (Figures 2 and 
3). Two classes of cDNA with or without thpe base pairs deletion were found in 
5 human as well (Figure 6). The mouse and^human ob genes were highly 

homologous in the predicted coding region, but had only 30% homology in the 
available 3' and 5' untranslated^gions. An N- terminal signal sequence was also 
present in the human ob polypeptide. Comparison of the human and mouse ob 
polypeptide sequences sjrowed that the two molecules share an overall 84% 
10 identity at amino add level (Figure 4). The N-termini of the mature proteins from 
both species shape even higher homology, with only four conservative and three 
nonconserv^jtfve amino acid substitutions among the N- terminal 100 amino acid 
residues, 

Genomic DNA was isolated from mouse, rat, rabbit, vole, cat, cow, sheep, pig, 
15 human, chicken, eel, and drosophila, and restriction digested with EcoRl. The 
digests were electrophoresed on 1 % agarose TBE gel. DNA was transferred to an 
imobilon membrane and probed with the PCR labeled 2G7 probe. The filter was 
hybridized at 65°C and washed with 2x SSC, 0.2% SDS at 65°C twice for twenty 
minutes each wash, i.e., there were two buffer changes. These data indicate that 
20 ob is conserved among vertebrates (Figure 16). Note in this regard that there is a 
2+ signal in eel DNA; eel is a fish. 

In summary, available evidence suggests that body weight and adiposity are 
physiologically controlled. Seven years ago efforts began to identify two of the 
key components of this system: the ob and db genes. As shown in this example, 
25 the ob gene has now been identified as a fat specific gene that plays a key role in 
regulating body weight. The product of this gene, which is most probably a 
secreted hormone, will have important implications for the diagnosis and treatment 
of nutritional disorders in man and non-human animals. 
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EXAMPLE: Expression of ob In Bacteria 



Both murine and human cDNAs encoding ob have been cloned into a pET-15b 
expression vector (Novagen). This vector contains a T7 promoter in conjunction 
with a lac operator, and expresses a fusion protein containing a histidine tag (His- 
Tag) and a thrombin cleavage site immediately upstream of the coding sequence 
insertion site (Figure 17) (SEQ ID No: 11). 

The mouse and human cDNAs were modified such that the alanine at the end of 
the signal sequence was turned into an Ndel site, as was a separate sequence in the 
3' region. Insertion of the Ndel site was accomplished using PCR with novel 
primers: 

Mnde-5' (murine five prime primer) : 

CTTATGTTCA TATGGTGCCG ATCCAGAAAG TC (SEQ ID NO: 12) 
Mnde-3' (murine three prime primer) : 

TCCCTCTACA TATGTCTTGG GAGCCTGGTG GC (SEQ ID NO: 13) 
Hnde-5' (human five prime primer) : 

TCTATGTCCA TATGGTGCCG ATCCAAAAAG TC (SEQ ID NO: 14) 
Hnde-3' (human three prime primer) : 

TTCCTTCCCA TATGGTACTC CTTGCAGGAA GA (SEQ ID NO: 15) 

The primers contain a 6-base pair mismatch in the middle that introduces Ndel 
restriction sites at each end of the PCR fragment. Phage carrying either the 
mouse or human cDNA were PCR amplified using those primers. The PCR 
product was digested with Ndel and gel purified on a 1 % low melting point 
agarose gel. The gel purified bands were subcloned into the pET vector. The 
resulting plasmids were sequenced to ensure that mutations were not introduced 
during the PCR amplification step of cloning. Constructs for the human and 
murine cDNA that encodes and that lacks glutamine 49 have been prepared. In 
particular, pET 15b constructs containing either the human or the mouse ob coding 
sequence, minus signal sequence and fused to a Hig-Tag, have been made using a 
PCR cloning method. The constructs have been sequenced to ensure no sequence 
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errors were introduced into the coding region of the ob gene during the PCR 
amplification step. 

Two resultant plasmid constructs, pETM9 and pETH14, were selected to 
transform a bacterial expression host. Upon induction with 1 raM IPTG under 
optimal conditions, the transformed bacteria were able to produce 100-300 fig/ml 
of the ob fusion. The majority of the ob fusion protein was found in the inclusion 
body. After solubilization with 6M guanidine-HCl or urea, the fusion protein was 
purified through a His-binding (Ni-chelation) resin column. The conditions for 
column purification of the ob fusion protein (including binding, washing, and 
eluting) were established experimentally. The ob fusion protein binds to the resin 
at 5 mM imidazol/6M guanidine-HCl and stays bound at up to 20 mM 
imidazol/6M guanidine-HCl. The protein can be eluted form the resin at 60 mM 
imidazol/6M guanidine (Figure 18A,B). Both the purified human and mouse ob 
fusion proteins were further dialyzed in PBS to remove guanidine-HCl from the 
preparation and used to raise polyclonal antibodies. 

In order to test the biological activity of the fusion protein products, the refolding 
conditions for the purified protein was tested and developed. This involves initial 
dialysis of the fusion protein in 1 M guanidine solution, followed by dilution with 
0.4 M arginine solution. The His-Tag was removed from the fusion proteins 
before biological function assay. The tag removal was achieved by treating the 
fusion protein with thrombin from human placenta. 

In addition, human and mouse ob gene coding sequence minus the signal sequence 
is being inserted into a pET 12c vector using PCR cloning method. These 
constructs can direct the synthesized ob fusion proteins into the periplasmic space 
of the bacterial host cell. The ob fusion protein recovered from the periplasmic 
space may only need a simple gel filtration to be purified from other host proteins 
and will not be denatured during such process. 
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EXAMPLE: Preparation of Antibodies to the ob Polypeptide 

In addition to use of the recombinant protein to generate polyclonal antibodies, a 
set of four peptide sequences from the deduced murine ob sequence were identified 
using immunogenicity plot software (GCG Package). The four carboxyl terminal 
5 peptide fragments are: 



(SEP ID NO: 18) : 

Val-Pro-Ile-Gln-Lys-Val-Gln-Asp-Asp-Thr-Lys-Thr-Leu-Ile-Lys-Thr 
(SEP ID NO: 19) : 

Leu-His-Pro-Ile-Leu-Ser-Leu-Ser-Lys-Met-Asp-Gln-Thr- Leu-Ala 

10 (SEP ID NP:20) : 

Ser-Lys-Ser-Cys-Ser-Leu-Pro-Gln-Thr-Ser-Gly-Leu-Gln-Lys-Pro-Glu-Ser-Leu-Asp 

(SEP ID NP:21) : 

Ser-Arg-Leu-Gln-Gly-Ser-Leu-Gln-Asp-Ile-Leu-Gln-Gln-Leu-Asp-Val-Ser-Pro-Glu- 
Cys 



15 These peptides were conjugated to KLH, and the peptide-KLH conjugates were 
used to immunize rabbits using standard techniques. Polyclonal antisera specific 
for each peptide is recovered from the rabbits. 

EXAMPLE: In Vitro Translocation of an ob Polypeptide 



In order to confirm the presence of a functional signal sequence, a human cDNA 
20 that included the entire open reading frame was subcloned into the pGEM vector. 
Pnly the human cDNA was used in this experiment because suitable mouse 
subclones were not recovered. Positive strand human ob RNA was transcribed 
using sp6 polymerase and used in an in vitro translation reaction with and without 
canine pancreatic microsomal membranes. The primary translation product 
25 migrated with an apparent molecular weight of — 18 kD, which is consistent with 
that predicted by the cDNA sequence. Inclusion of the microsomal membranes in 
the reaction inhibited the overall efficiency of translation —5 fold. Nevertheless, 
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approximately 50-70% of the ob primary translation product was truncated by 
approximately 2 kD in the presence of the membrane preparation, suggesting that 
the signal sequence is functional (Figure 19A). The size of the primary translation 
product of interleukin-la RNA, which does not encode a signal sequence, was 
5 unchanged when microsomal membranes were included in the reaction. In order 
to confirm that translocation of the ob protein had taken place, the in vitro 
translation products were treated with Proteinase-K. Protease treatment resulted in 
the complete proteolysis of the 18 kD primary translation product while the 16 kD 
processed form was unaffected by the enzyme treatment, indicating that it had 
10 translocated into the lumen of the microsomes (Figure 19B). These data are 
compatible with the hypothesis that ob is a secreted molecule. 

After signal sequence cleavage, two cysteine residues would remain within the 
predicted protein raising the possibility that the molecule contains a disulfide bond 
characteristic of other secreted polypeptides (Shen and Rutter, 1984, Science 
15 224:168-171). 

EXAMPLE: Characterization of the ob Gene 

To establish the relationship between oBesity and genetic alterations in the ob gene 
in humans, the sequence of the humeri ob gene was determined (Figure 20 A) (SEQ 
ID NO: ). Specific primers fronr Jie human coding sequence were used to screen 
20 a human PI library. Three different PI clones were obtained, grown up, and PGR 
amplified using primers flzmking the splicing site between the first and second 
coding exon. The entjre intron region, around 2 kB, was amplified and partially 
sequenced (see Figure 20A; and as indicated in SEQ ID NO:22). 

The gene structure of both the murine and human genes was characterized using 
25 PCR assays and other standard techniques. The mouse ob gene was found to 

consist of 3 exons, the second and third of which account for the coding sequence 
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(Figure 20B). The coding region of the human ob gene shares the same structure; 
however, the human gene lacks a 5' exon and intron (Figure 20C). 



01 



Two sets of primers generated from the intronic sequences of the human gene have 
been prepared (Figure 20A). The sequences of the primers follows (F and R refer 
5 to forward and reverse, respectively): 

HOB lgF 5'-CCCAAGAAGCCCATCCTG-3' (SEQ ID NO:26) 

HOB lgR 5 '-GACTATCTGGGTCCAGTGCC-3 ' (SEQ ID NO:27) 

HOB 2gF 5 '-CCACATGCTGAGCACTTGTT-3 ' (SEQ ID NO:28) 

HOB 2gR 5 '-CTTCAATCCTGGAGATACCTGG-3 ' (SEQ ID NO:29) 

01 10 DNA samples have been obtained from various sources, and these sets of primers 

are being used to amplify human genomic DNA from severely obese people. The 
PCR products were run on a low melting point agarose gel, and the bands were 
cut out and digested with agarase. The sequences were obtained using the ABI 
373 A DNA sequencer and Taq dideoxy terminator kit (abi, Perkin- Elmer). One 
15 point mutation in an ob gene from a patient sample has been detected to date. 
This mutation is on the first exon and does not change the amino acid sequence. 
Preliminary data indicate that an insertion sequence may be present in the first 
exon of another patient. 

A different automated sequencing method with Sequenase instead of Taq DNA 
20 polymerase may be employed to yield more easily readable sequences for mutation 
detection. 

EXAMPLE: Expression of ob in Yeast 



25 



Following the positional cloning of ob y it became important to uncover the 
physiological mechanism by which the ob protein reduces food intake and body 
weight. The first step in this direction was to recombinantly produce a functional 
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protein using an expression system. In addition to the successful bacterial 
expression system, a yeast expression system was also selected. Yeast expression 
has several attractive features for expressing ob. The most important is that 
biologically active eukaryotic proteins are more likely to be produced. The ob 
polypeptide is secreted by mammalian cells. Protein secretion is very similar for 
all eukaryotes, which means that the yeast secretory apparatus is much more 
similar to the mammalian secretory pathway than bacterial secretory pathways 
would be. In particular, protein modifications of ob seen in mammalian cells 
would likely also be seen in the expression through the yeast secretory system. In 
addition, protein folding is carried out in passage through the secretory apparatus 
and thus delivering ob through the yeast secretory apparatus is likely to give a 
properly folded protein with native biological activity. This is significant for ob 
because the two cystein residues may form a disulfide bridge. In contrast to 
secretory pathways, the reducing environment of the cell cytoplasm prevents 
formation of disulfide bridges, and therefore it is essential that ob pass through the 
secretory pathway in order for this disulfide bond to form in vivo. Other 
advantages have to do with the ease and quickness of manipulating yeast, the 
availability of vectors and strains, and the vast experience in yeast recombinant 
technology. 

A Pichia pastoris expression system was chosen for four reasons: (1) it has 
higher levels of heterologous protein expression than other yeast systems such as 
S. cerevisiae\ (2) protein glycosylation is more similar to the mammalian system in 
P. pastoris than S. cerevisiae (although glycosylation sites were not detected in ob 
using a computer search, there still remained the possibility of glycosylation at 
unrecognized sites); (3) P. pastoris secretes very few proteins natively, and thus it 
is generally straightforward to purify the expressed foreign protein; and (4) the 
vectors and yeast strains are commercially available (from Invitrogen). Two 
strategies for generating yeast expression vectors are shown in Figures 21 and 22. 
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The vector chosen was pPIC.9. This vector contains a cloning site just 
downstream of the alpha-mating factor prepro coding sequence which directs the 
protein encoded by the gene cloned into the cloning site to be secreted by the 
secretory pathway. The other important feature of the vector is a HIS4 gene that 
5 allows selection for uptake of the vector using a yeast auxotrophic strain grown on 
histidine-deficient media following transformation of the yeast with the vector. 
The cloning strategy was the following: PCR amplify ob cDNA using a 5' primer 
that contained at its 3' end sequence complementary to the sequence of ob just 
following the predicted leader peptide cleavage site, and at its most 5' end a 
10 sequence complementary to the 3' end of the alpha-mating factor sequence of the 
vector. The 5' primer also contains an Xhol site. The 3' primer was designed to 
have at its 3' end a sequence complementary to the last few amino acids of ob and 
an EcoRI site at its 5' end. Following PCR amplification, the PCR product was 
digested with Xhol and EcoRI and cloned into similarly digested pPIC.9. 
15 Following the cloning of both the mouse and human ob cDNAs, each with and 
without the glutamine at codon 49, individual clones were isolated for all four 
individual constructs and sequenced to verify that the constructs were cloned in the 
correct orientation and frame and contained no mutations from the PCR 
amplification step. Following identification of clones with the correct sequence, 
these were transformed into P. pastoris strain GS115, a histidine auxotroph. 

For the two mouse ob constructs, transformed yeast clones were screened for 
protein expression. As evidence that the transformed yeast contain ob, a DNA 
dot-blot assay and a colony hybridization assay were done which both showed ob 
sequence within the transformed yeast but not within the untransformed yeast. 
Furthermore, the transformed yeast now secreted a 16 kDa protein into the culture 
media whereas the untransformed yeast does not secrete a protein of this size 
(Figure 23 A). This is the predicted size of ob. Individual clones for both mouse 
constructs have been identified that are high expressors for ob, and currently a 
purification strategy is being developed to purify ob to homogeneity. One strategy 
has been to purify ob on a cation exchange column (Figure 23B); preliminary data 
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suggest that a strong cation exchanger may be useful. However, after cation 
exchange chromatography, the putative ob product is lost. This indicates the 
presence of a protease in the sample. 

One strategy to overcome this problem is to prepare ob-His tag fusions for 
5 expression in yeast (Figure 22). Further evaluation has demonstrated that ob 

without a His tag associates tightly with a Ni-chelation column. Purification of the 
ob polypeptide by Ni-chelation, followed by gel filtration, yielded a product of 
sufficient purity for mass spectral analysis. Mass spec confirms the molecular 
weight of the expressed protein is identical to the expected molecular weight, 
10 which strongly confirms that ob has been successfully expressed in Pichia. 

However, the Ni-chelation/gel filtration purification protocol does not yield a ob 
polypeptide in. sufficiently pure form. Additional small molecules are present. It 
does appear that the proteolytic activity elutes from the Ni-chelation column in the 
void volume. Accordingly, a three step purification process is planned: Ni- 
chelation, followed by cation exchange (which eliminates the small molecule 
contaminants), followed by gel filtration. 

Estimating expression level by Coomassie blue staining of SDS-PAGE gels reveals 
approximately 10 mg/L when yeast are grown in shaker flasks. These levels are 
expected to increase in fermentation vessels, and we are about to initiate 
fermentation with the hopes of obtaining larger quantities of protein. Regarding 
the human ob constructs, transformed yeast clones containing high copy numbers 
of the ob gene have been identified, and these are expected to express ob protein. 
As antibodies are developed, these will be used to confirm the identity of the 
secreted 16 kDa protein. 

25 Once purified the expressed protein is characterized by several methods. Physical 
characterization includes light-scattering to determine homogeneity of structure and 
is used as a measure of proper folding. Circular dichroism is used to roughly 
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determine the structural geometry of the protein. Importantly, bioactivity of the 
purified protein is assayed by administering the protein to both elan and obese 
rodents via an osmotic pump (e.g. , an ALZET osmotic pump from Alza 
Corporation, Palo Alto, CA) over at least a two-week period and observing effects 
on feeding behavior and body weight. 

The following is a list of references related to the above disclosure and particularly 
to the experimental procedures and discussions. 

Bahary, N.; G. Zorich; J.D. Pachter; R. L. Leibel; and J.M. Friedman. 1991. 
Molecular genetic linkage maps of mouse chromosomes 4 and 6. Genomics 
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Bahary, N.; D. McGraw; R.L. Leibel; and J.M. Friedman. 1991. Chromosomal 
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the mouse db gene. Submitted. 
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l(suppl): s51-s78. 
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Truett, G.E.; N. Bahary; J. M. Friedman; and R. L. Leibel. 1991. The rat 
obesity fatty (fa) maps to chromosome 5: Evidence for homology with the mouse 
gene diabetes (db). Proc. Natl. Acad. Sci. USA 88:7806-7809. 

This invention may be embodied in other forms or carried out in other ways 
5 without departing from the spirit or essential characteristics thereof. The present 
disclosure is therefore to be considered as in all respects illustrative and not 
restrictive, the scope of the invention being indicated by the appended Claims, and 
all changes which come within the meaning and range of equivalency are intended 
to be embraced therein. 

Various references are cited throughout this specification, each of which is 
incorporated herein by reference in its entirety. 
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